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(57) Abstract 



A gene (cDNA) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as SEQ ID NO:l and the protein 
sequence is identified as SEQ ID NO: 2. A mutant gene (SEQ ID NO: 3) in which the coding sequence lacks an 1 1-base pair consecutive 
sequence (SEQ ID NO:l 1) of the sequence encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are double-muscled. A method for determining 
the presence of muscular hyperplasia in a mammal is described. The method includes obtaining a sample of material containing DNA from 
the mammal and ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity of myostatin, is present, and 
whether a sequence of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of (a) and the presence 
of (b) indicates the presence of muscular hyperplasia in the mammal. 
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MUTATIONS IN THE MYOSTATIN GENE CAUSE DOUBLE-MUSCLING IN MAMMALS 
Field of The Invention 

This invention relates to factors affecting muscle development in mammals, 
especially livestock. In particular, this invention relates to the cloning of the myostatin gene, a 
5 member of the TGF-p superfamily, its involvement in muscular hyperplasia in livestock, and a 
method for determining myostatin genotypes. 

Description Of Related Art 

The TGF-p superfamily consists of a group of multifunctional polypeptides which 
control a wide range of differentiation processes in many mammalian cell types. GDF-8 is a 

1 0 member of the TGF-p superfamily. All members of this superfamily share a common structure 
including a short peptide signal for secretion and an N-terminat peptide fragment that is 
separated from the bioactive carboxy-terminal fragment by proteolytic cleavage at a highly 
conserved proteolytic cleavage site. The bioactive carboxy-terminal domain is characterized by 
cysteine residues at highly conserved positions which are involved in intra- and intermolecular 

15 disulfide bridges. The functional molecules are covalently linked (via a S-S bond) dimers of the 
carboxy-terminal domain (Masterson e/a/., 1996). 

Recently, it was reported that mice deficient in the gene coding for GDF-8 were 
characterized by a generalized muscular hyperplasia (McPherron et a/. f 1997). The GDF-8 
deficient mice were produced by gene targeting using homologous recombination in embryonic 

20 stem cells, a method referred to as "gene knock-out". The murine generalized muscular 
hyperplasia appeared to be very similar in its expression to the muscular hyperplasia 
characterizing "double-muscled" cattle. This observation raised the intriguing possibility that the 
bovine gene coding for GDF-8 (i.e. the bovine evolutionary homologue of the mouse GDF-8 
gene) is involved in the bovine double-muscling phenotype. It also raised the possibility that the 

25 human gene coding for GDF-8 (i.e. the human evolutionary homologue of the mouse GDF-8 
gene) is involved in regulating muscular development in humans, specifically skeletal muscle 
genesis. Isolation of the human GDF-8 gene may have therapeutic uses/applications in the 
treatment of musculodegenerative diseases through upgrading or downgrading the expression of 
GDF-8. 

30 The occurrence of animals characterized by a distinct generalized muscular 

hypertrophy, commonly known as "double-muscled" animals, has been reported in several cattle 
breeds around the world. The first documented description of double-muscled cattle dates back 
as early as 1 807 (Culley, 1 807). One of the breeds in which this characteristic has been most 
thoroughly analyzed is the Belgian Blue Cattle Breed ("Belgian Blue Breed"). This is one of the 

35 only breeds where the double-muscled trait has been systematically selected for, and where the 
double-muscled phenotype is virtually fixed. A comparison of double-muscled and conventional 
animals within the Belgian Blue Breed, showed an increase in muscle mass by 20% on average, 
while all other organs were reduced in size (Hanset, 1986 and 1991). The muscular hypertrophy 
was shown to be an histological hyperplasia affecting primarily superficial muscles, 

40 accompanied by a 50% reduction in total lipid content and a reduction in connective tissue 
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fraction as measured by hydroxyprofine content (Hanset et a/. ( 1982). Double-muscled animals 
were shown to have a reduced feed intake with improved feed conversion ratio (Hanset et a/., 
1987). An important economic benefit of double-muscled animals, in contrast to conventional 
animals, is the substantia! increase in selling price and net income for the farmer (Hanset et e/., 
5 1987). 

One of the most thorough series of studies on double-muscling is that of Hanset 
and colleagues in the Belgian Blue Breed. Objective criteria of muscular development, such as 
dressing-out percentage, lean and fat percentage, plasma and red cell creatine and creatinine 
concentrations, were measured on nearly 150 randomly selected animals raised in standardized 
10 conditions. These studies clearly revealed abnormal, bimodal distributions of the double- 
muscled phenotype and objectively confirmed the visual classification traditionally performed by 
breeders on double-muscled and conventional animals. The phenotypic distribution was 
resolved using a maximum likelihood procedure into two component normal populations with a 
common variance which revealed mean differences of three to four standard deviations 
15 depending on the trait. This suggested the presence of an allele having a major effect on 

muscular development with a population frequency close to 50% (Hanset and Michaux, 1985b). 
The most convincing evidence in favour of such an allele, however, came from experimental 
crosses involving double-muscled Belgian Blue sires and Holstein Friesian dairy cows (the latter 
animals having very poor muscular development). While F1 offspring showed a phenotypic 
20 distribution very similar to their Holstein Friesian dams, backcrossing these F1's to double- 
muscled sires produced a bimodal BC generation, clearly pointing towards the Mendelian 
segregation of a recessive a mh" (muscular hypertrophy) allele (Hanset and Michaux., 1 985a). 

The same kind of experimental crosses were subsequently used to perform a 
whole genome scan using a microsatellite based marker map. To perform the linkage analysis, 
25 animals were classified as double-muscled or conventional. Very significant Logarithm of the 
Odds scores (lodscores) were obtained on chromosome 2 (> 17), and multi point linkage 
analysis positioned the mh locus at the centromeric end of this chromosome, at [2]centimorgan 
from the nearest microsatellite marker: TGLA44. The corresponding chromosomal region 
accounted for all the variance of the trait assumed to be fully penetrant in this experiment 
30 (Charlierefa/., 1995). 

In humans, genes coding for some forms of muscular abnormalities have been 
isolated, e.g. muscular dystrophy. The present invention provides for the gene which regulates 
the development of skeletal muscle only, as opposed to other types of muscle, e.g. smooth or 
cardiac muscle. The present invention may provide an understanding of the role of the GDF-8 
35 gene or its receptor in the regrowth of skeletal muscle in humans which only undergo a 
hyperplasic response. 
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Summary of the Invention 

The present inventors have identified and sequenced a gene (cDNA and 
genomic) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as 
SEQ ID NO:1 and the protein sequence is identified as SEQ ID NO:2. The genomic bovine 
5 sequence is identified as SEQ ID NO:54. A mutant gene (SEQ ID NO:3) in which the coding 
sequence lacks an 1 1-base pair consecutive sequence (SEQ ID NO:1 1) of the sequence 
encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are 
double-muscled. Other bovine mutations which lead to double-muscling in have also been 
10 determined, being identified herein as nt419(del7-ins10) t Q204X, E226X and C313Y, 
respectively. 

In one aspect, the present invention thus provides a method for determining the 
presence of muscular hyperplasia in a mammal. The method includes obtaining a sample of 
materia! containing DNA from the mammal and ascertaining whether a sequence of the DNA 
15 encoding (a) a protein having biological activity of myostatin, is present, and whether a sequence 
of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of 
(a) and the presence of (b) indicates the presence of muscular hyperplasia in the mammal. 

Of course, the mutation responsible for the lack of activity can be a naturally 
occurring mutation, as is the case for the Belgian Blue, Asturiana, Parthenaise or Rubia Gallega 
20 breeds, shown here. 

The mammal can be a human, bovine, etc. 

There are several methods known for determining whether a particular 
nucleotide sequence is present in a sample. A common method is the polymerase chain 
reaction. A preferred aspect of the invention thus includes a step in which ascertaining whether a 

25 sequence of the DNA encoding (a) is present, and whether a sequence of the DNA encoding (b) 
is present includes amplifying the DNA in the presence of primers based on a nucleotide 
sequence encoding a protein having biological activity of myostatin. 

A primer of the present invention, used in PCR for example, is a nucleic acid 
molecule sufficiently complementary to the sequence on which it is based and of sufficient length 

30 to selectively hybridize to the corresponding portion of a nucleic acid molecule intended to be 
amplified and to prime synthesis thereof under in vitro conditions commonly used in PCR. 
Likewise, a probe of the present invention, is a molecule, for example a nucleic acid molecule of 
sufficient length and sufficiently complementary to the nucleic acid molecule of interest, which 
selectively binds under high or low stringency conditions with the nucleic acid sequence of 

35 interest for detection thereof in the presence of nucleic acid molecules having differing 
sequences. 

In preferred aspects, primers are based on the sequence identified as SEQ ID 
NO:7 (human cDNA sequence) or SEQ ID NO:54. 
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In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a mammal which includes obtaining a sample of material containing 
mRNA from the mammal. Such method includes ascertaining whether a sequence of the 
mRNA encoding (A) a protein having biological activity of myostatin, is present and whether a 
5 sequence of the mRNA encoding (B) a protein at least partially encoded by a truncated 

nucleotide sequence corresponding to substantially the sequence of the mRNA and lacking the 
activity of (A), is present The absence of (A) and the presence of (B) indicates the presence of 

muscular hyperplasia in the mammal. 

The mRNA encoding (A) and the truncated sequence can correspond to alleles 

10 of DNA of the mammal. 

Again, if an amplification method such as PCR is used in ascertaining whether a 
sequence of the mRNA encoding (A) is present, and whether a sequence of the mRNA encoding 
(B) is present, the method includes amplifying the mRNA in the presence of a pair of primers 
complementary to a nucleotide sequence encoding a protein having biological activity of 

15 myostatin. Each such primer can contain a nucleotide sequence substantially complementary, 
for example, to the sequence identified as SEQ ID NO:7. The truncated sequence can contain at 
least 50 consecutive nucleotides substantially corresponding to 50 consecutive nucleotides of 

SEQ ID NO:7, for example. 

In another aspect, the invention is a method for determining the presence of 
20 muscular hyperplasia in a mammal which includes obtaining a tissue sample of containing 
mRNA of the mammal and ascertaining whether an mRNA encoding a mutant type myostatin 
protein lacking biological activity of myostatin is present. The presence of such an mRNA 
encoding a mutant type myostatin protein indicates the presence of muscular hyperplasia in the 
mammal. 

25 )n another aspect, the invention thus provides a method for determining the 

presence of muscular hyperplasia in a bovine animal. The method includes obtaining a sample 
of material containing DNA from the animal and ascertaining whether DNA having a nucleotide 
sequence encoding a protein having biological activity of myostatin is present. The absence of 
DNA having such a nucleotide sequence indicates the presence of muscular hyperplasia in the 

30 animal. Ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin can include amplifying the DNA in the presence of primers based 
on a nucleotide sequence encoding a protein having biological activity of myostatin. 

In particular, the method can be carried out using a sample from an animal in 
which such a bovine animal not displaying muscular hyperplasia is known to have a nucleotide 

35 sequence which is capable of hybridizing with a nucleic acid molecule having the sequence 
identified as SEQ ID NO:1 under stringent hybridization conditions. 

It is possible that ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
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in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 

Primers, say first and second primers, can be based on first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein the regions flank a 
5 mutation known to naturally occur and which when present in both alleles of a such an animal 
results in muscular hyperplasia. 

It can also be that DNA of such an animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 

10 sequence of DNA of a such an animal displaying muscular hyperplasia is known to contain an 
1 1-base pair deletion beginning at base pair no. 821 of the coding sequence, and said first primer 
is selected to be upstream of the codon encoding glutamic acid no. 275 and the second primer 
is selected to be downstream of the codon encoding aspartic acid no. 274. 

Also, a DNA of such an animal not displaying muscular hyperplasia might 

15 contain a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2. The coding 
sequence of DNA of such an animal displaying muscular hyperplasia might be known to contain 
an 1 1-base pair deletion beginning at base pair no. 821 . A primer can be selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing the 

20 deletion. 

The animal can be of the Belgian Blue breed. 

In a particular aspect, ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of a primer containing at least a portion of a mutation known to naturally occur 
25 and which when present in both alleles of a said animal results in muscular hyperplasia. 

In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a bovine animal which includes obtaining a sample of the animal 
containing mRNA and ascertaining whether an mRNA encoding a protein having biological 
activity of myostatin is present in the sample. The absence of said mRNA indicates the presence 
30 of muscular hyperplasia in the animal. 

A sample containing mRNA can be muscle tissue, particularly, skeletal muscle 

tissue. 

In a particular aspect, the invention is a method for determining the presence of 
double muscling in a bovine animal, involving obtaining a sample of material containing DNA 
35 from the animal and ascertaining whether the DNA contains the nucleotide sequence identified 
as SEQ ID NO:11 in which the absence of the sequence indicates double muscling in the animal. 

In a particular aspect, the animal is of the Belgian Blue breed. 

In another aspect, the invention is a method for determining the myostatin 
genotype of a mammal, as may be desirable to know for breeding purposes. The method 
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includes obtaining a sample of materia, containing nucleic acid of the mammal, wherein the 
nucleic acid is uncontaminated by heterologous nucleic acid; ascertaining whether the sample 
contains a (i) nucleic acid molecule encoding a protein having biological activity of myostatin; and 
ascertaining whether the sample contains an (ii> allelic nucleic acid molecule encoding a P rote,n 
5 lacking biological activity of myostatin. The mammal can be bovine. 

In another aspect, the subject is human and (0 includes a nucleic acid sequence 
substantially homologous 0" the sense of identity) with the sequence identified as SEQ ID NO:7. 

The invention includes a method of increasing muscle mass of a mammal 
having muscle cells in which myostatin is expressed, the method comprising administering to the 
1 o mammal an effective amount of a nucleic acid mo.ecule substantially complementary to at least 
a portion of mRNA encoding the myostatin and being of sufficient length to sufficiently reduce 
expression of the myostatin to increase the muscle mass. In a particular aspect, the mammal s 
bovine. 

In another embodiment, the invention is a method of increasing muscle mass of 
15 a mammal, including administering to the mammal an effective amount of a nucleic acid 

molecule having ribozyme activity and a nucleotide sequence substantially complementa.y to at 
least a portion of mRNA encoding myostatin and being of sufficient length to bind selectively 
thereto to sufficiently reduce expression of the myostatin so as to increase the muscle mass. 

The invention includes a diagnostic kit, for determining the presence of muscular 
20 hyperplasia in a mammal from which a sample containing DNA of the mammal has been 
obtained The kit includes first and second primers for amplifying the DNA, the primers be,ng 
complementary to nucleotide sequences of the DNA upstream and down stream, respectively, of 
a mutation in the portion of the DNA encoding myostatin which results in muscular hyperplas.a of 
the mammal, wherein at least one of the nucleotide sequences is selected to be from a non- 
25 coding region of the myostatin gene. The kit can also includes a third primer complementary to a 
naturally occurring mutation of a coding portion of the myostatin gene. 

A particular diagnostic kit, for determining the genotype of a sample of 
mammalian genetic material, particularly bovine material includes a pair of primers for amplifying 
a portion of the genetic material corresponding to a nucleotide sequence which encodes at least 
30 a portion of a myostatin protein, wherein a first of the primers includes a nucleotide sequence 
sufficiently complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic acid 
molecule containing the mutation, the mutation being selected from the group of mutations 
resulting from: (a) deletion of 11 nucleotides beginning at nucleotide 821 of the coding portion of 
SEQ ID NO:1 ; (b) deletion of 7 nucleotides beginning at nucleotide 419 of the coding sequence 
35 and insertion of the sequence AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of 
the coding sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 
coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 313 of the 
coding sequence and insertion of A in place thereof; and combinations thereof. The second of 
the pair of primers is preferably located entirely upstream or entirely downstream of the selected 
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mutation or mutations. In one kit, a first said primer spans mutation (a) and further comprising a 
third primer which is sufficiently complementary to the nucleotide sequence identified as SEQ ID 
NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID NO:1 1 . In another (or 
the same kit), a first said primer is sufficiently complementary to the inserted sequence of 
5 mutation (b) to prime amplification of a nucleic acid molecule containing mutation (b) and further 
comprising a third primer which is sufficiently complementary to the sequence corresponding to 
the 7 nucleotide deletion of mutation (b) to prime amplification of a nucleic acid molecule 
containing the 7 nucleotide deletion of mutation (b). In another (or the same kit), a first said 
primer spans mutation (c) and further comprising a third primer which is sufficiently 

10 complementary to the sequence spanning the corresponding region lacking mutation (c) to prime 
amplification of a nucleic acid molecule lacking mutation (c). In another (or the same kit), a first 
said primer spans mutation (d) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (d) to 
prime amplification of a nucleic acid molecule lacking mutation (d). In another (or the same kit), 

15 a first said primer spans mutation (e) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (e) to 
prime amplification of a nucleic acid molecule lacking mutation (e). 

The invention includes a purified protein having biological activity of myostatin, 
and having an amino acid sequence identified as SEQ ID NO:2, or a conservatively substituted 

20 variant thereof. The invention includes a purified bovine protein having biological activity of 
myostatin or a purified human protein (SEQ ID NO:8) having biological activity of myostatin. 

The invention includes an isolated nucleic acid molecule encoding a foregoing 
protein. Particularly, the invention includes an isolated nucleic acid molecule comprising a DNA 
molecule having the nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:3 or SEQ ID 

25 NO:7 or which varies from the sequence due to the degeneracy of the genetic code, or a nucleic 
acid strand capable of hybridizing with at least one said nucleic acid molecule under stringent 
hybridization conditions. 

The invention includes isolated mRNA transcribed from DNA having a sequence 
which corresponds to a nucleic acid molecule of the invention. 

30 The invention includes isolated DNA in a recombinant cloning vector and a 

microbial cell containing and expressing heterologous DNA of the invention. 

The invention includes a transfected cell line which expresses a protein of the 

invention. 

The invention includes a process for producing a protein of the invention, 
35 including preparing a DNA fragment including a nucleotide sequence which encodes the protein; 
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA molecule 
which includes the DNA fragment and is capable of undergoing replication; 
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transforming a host cell with the recombinant DNA molecule to produce a transformant which 
can express the protein; culturing the transformant to produce the protein; and recovering the 
protein from resulting cultured mixture. 

The invention includes a method of inhibiting myostatin so as to induce 

5 increased muscle mass in a mammal, comprising administering an effective amount of an 
antibody to myostatin to the mammal. 

The invention includes a method of increasing muscle mass in a mammal, by 
raising an autoantibody to the myostatin the in the mammal. Raising the autoantibody can 
include administering a protein having myostatin activity to the mammal. 

1 o The invention includes a method of increasing muscle mass in a mammal 

including administering to the mammal an effective amount of an antisense nucleic acid or 
oligonucleotide substantially complementary to at least a portion of the sequence identified as 
SEQ ID NO:1 or SEQ ID NO:5, or SEQ ID NO:7. The portion can be at least 5 nucleotide bases 
in length or longer. The mammal can be a bovine and the sequence can be that identified as 

15 SEQIDNO:1. 

The invention includes a method of inhibiting production of myostatin in a 
mammal in need thereof, including administering to the mammal an effective amount of an 
antibody to the myostatin. 

The invention includes a probe containing a nucleic acid molecule sufficiently 
20 complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as to bind 
thereto under stringent conditions. The probe can be a sequence which is between about 8 and 
about 1195 nucleotides in length. 

The invention includes a primer composition useful for detection of the presence 
of DNA encoding myostatin in cattle. The composition can include a nucleic acid primer 
25 substantially complementary to a nucleic acid sequence encoding a bovine myostatin. The 
nucleic acid sequence can be that identified as SEQ ID NO:1 . 

The invention includes a method for identifying a nucleotide sequence of a 
mutant gene encoding a myostatin protein of a mammal displaying muscular hyperplasia. The 
method includes obtaining a sample of material containing DNA from the mammal and probing 
30 the sample using a nucleic acid probe based on a nucleotide sequence of a known gene 

encoding myostatin in order to identify nucleotide sequence of the mutant gene. In a particular 
approach, the probe is based on a nucleotide sequence identified as SEQ ID NO:1 , SEQ ID NO:5 
or SEQ ID NO;7. Preferably, the probe is at least 8 nucleic acids in length. The step of probing 
the sample can include exposing the DNA to the probe under hybridizing conditions and further 
35 comprising isolating hybridized nucleic acid molecules. The method can further include the step 
of sequencing isolated DNA. The method can include the step of isolating and sequencing a 
cDNA or mRNA encoding the complete mutant myostatin protein. The method can include a 
step of isolating and sequencing a functional wild type myostatin from the mammal not displaying 
muscular hyperplasia. 
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The method can include comparing the complete coding sequence of the 
complete mutant myostatin protein with, if the coding sequence for a functional wild type 
myostatin from such a mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from such a mammal is previously unknown, (2) 
5 the sequence determined according to claim 63 or claim 66, to determine the location of any 
mutation in the mutant gene. 

The invention includes a primer composition useful for the detection of a 
nucleotide sequence encoding a myostatin containing a first nucleic acid molecule based on a 
nucleotide sequence located upstream of a mutation determined according to a method of the 
10 invention and a second nucleic acid molecule based on a nucleotide sequence located 
downstream of the mutation. 

A probe of the invention can include a nucleic acid molecule based on a 
nucleotide sequence spanning a mutation determined according to the invention. 

The invention includes an antibody to a protein encoded by a nucleotide 
15 sequence identified as SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:7, or other protein of the 
present invention. 

The invention includes a transgenic bovine having a genome lacking a gene 
encoding a protein having biological activity of myostatin; a transgenic mouse having a genome 
containing a gene encoding a human protein having biological activity of myostatin or containing 

20 a gene encoding a bovine protein having biological activity of myostatin; a transgenic bovine 

having a gene encoding a bovine protein having biological activity of myostatin and heterologous 
nucleotide sequence antisense to the gene. The transgenic bovine can include a gene encoding 
a nucleic acid sequence having ribozyme activity and in transcriptional association with the 
nucleotide sequence antisense to the gene. 

25 The invention includes a transgenic mammal, usually non-human, having a 

phenotype characterized by muscular hyperplasia, said phenotype being conferred by a 
transgene contained in the somatic and germ cells of the mammal, the transgene encoding a 
myostatin protein having a dominant negative mutation. The transgenic mammal can be male 
and the transgene can be located on the Y chromosome. The mammal can be bovine and the 

30 transgene can be located to be under the control of a promoter which normally a promoter of a 
myosin gene. 

Another transgenic mammal, usually non-human, of the invention has a 
phenotype characterized by muscular hyperplasia, in which the phenotype is conferred by a 
transgene having a sequence antisense to that encoding a myostatin protein of the mammal. 
35 The mammal can be a male bovine and the transgene can be located on the Y chromosome. 
The transgene can further include a sequence which when transcribed obtains an mRNA having 
ribozyme activity. 

A transgenic non-human mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype inducible and conferred by a 



WO 99/02667 PCT/IB98/01197 



10- 



myostatin gene flanked by J oxP sides and a Cre transgene under the dependence of an 

inducible promoter. 

A transgenic non-human male mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype conferred by a myostatin gene 
5 flanked by J oxP sides and a Cre transgene located on the Y chromosome. 

The invention includes a method for determining whether a sample of 
mammalian genetic material is capable of a conferring a phenotype characterized by muscular 
hyperplasia, comprising ascertaining whether the genetic material contains a nucleotide 
sequence encoding a protein having biological activity of myostatin, wherein the absence of said 
1 0 sequence indicates the presence of muscular hyperplasia in the animal. 

Brief Description Of Drawings 

In describing particular aspects of the invention, reference is made to the 

accompanying drawings, in which: 

Figure 1 is a schematic summary of genetic, physical and comparative mapping 
1 5 information around the bovine mh locus. A multi-point lodscore curve obtained for the mh locus 

with respect to the microsatellite marker map is shown. Markers that were not informative in the 

pedigree used are shown between brackets; their map position is inferred from published 

mapping data. Markers and the YACs from which they were isolated are connected by arrows. 

The RH-map of the relevant section of human HSA2 is shown, with the relative position in cR of 
20 the ESTs used. Stippled lines connect microsatellite and Type I markers with their respective 

positive YACs. YACs showing cross-hybridizing SINE-PCR products are connected by the red 

boxes. 

Figure 2(a) shows electropherograms obtained by cycle-sequencing the 
myostatin cDNA sequence from a double-muscled and a conventional animal, showing the 
25 nt821del(1 1) deletion (SEQ ID NO:1 1) in the double-muscled animal. The primers used to 
amplify the fragment encompassing the deletion from genomic DNA are spaced apart from the 

remaining nucleotides. 

Figure 2(b) shows the amino-acid sequence of the murine (top row), bovine 
normal (middle row) and bovine nt821del(11) (bottom row) allele. The putative site of proteolytic 
30 processing is boxed, while the nine conserved cysteines in the carboxy-terminal region are 

underlined. The differences between the normal and nt821del(11) bovine allele are indicated by 

the double underlining. 

Figure 3 is a schematic representation of the bovine myostatin gene with position 
and definition of the identified DNA sequence polymorphisms. The "A" (clear) boxes correspond 
35 to the untranslated leader and trailer sequences (large diameter), and the intronic sequences 
(small diameter) respectively. The "B", "C". and "D" boxes correspond to the sequences coding 
for the leader peptide. N-terminal latency-associated peptide and bioactive carboxyterminal 
domain of the protein respectively. Small "e", "F and "g" arrows point towards the positions of the 
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primers used for intron amplification, exon amplification and sequencing and exon sequencing 
respectively; the corresponding primer sequences are reported in Table 1 . The positions of the 
identified DNA sequence polymorphisms are shown as "h", T or "j" lines on the myostatin gene 
for silent, conservative and disrupting mutations respectively. Each mutation is connected via an 
5 arrow with a box reporting the details of the corresponding DNA sequence and eventually 
encoded peptide sequence. In each box, the variant sequence is compared with the control 
Holstein-Friesian sequence and differences are highlighted in color. 

Figure 4 shows the distribution of identified mutations in the various breeds 
examined. The order of the myostatin mutations correspond to Figure 3. All analyzed animals 
1 0 were double-muscled except for the two Holstein-Friesian and two Jerseys used as controls 
(column 1). 

Detailed Description Of Preferred Embodiments 

The method used for isolating genes which cause specific phenotypes is known 
as positional candidate cloning. It involves: (i) the chromosomal localization of the gene which 

15 causes the specific phenotype using genetic markers in a linkage analysis; and (ii) the 

identification of the gene which causes the specific phenotype amongst the "candidate" genes 
known to be located in the corresponding region. Most of the time these candidate genes are 
selected from available mapping information in humans and mice. 

The tools required to perform the initial localization (step (i) above) are 

20 microsatellite marker maps, which are available for livestock species and are found in the public 
domain (Bishop et a/., 1994; Barendse et a/., 1994; Georges et a/., 1995; and Kappes, 1997). 
The tools required for the positional candidate cloning, particularly the YAC libraries, (step (ii) 
above) are partially available from the public domain. Genomic libraries with large inserts 
constructed with Yeast Artificial Chromosomes CYAC") are available in the public domain for 

25 most livestock species including cattle. When cross-referencing the human and mice map, it is 
necessary to identify the positional candidate, which is available at low resolution but needs to be 
refined in every specific instance to obtain the appropriate level of high resolution. A number of 
original strategies are described herein to achieve this latter objective. For general principles of 
positional candidate cloning, see Collins, 1995 and Georges and Andersson, 1996. 

30 In order to allow for cross-referencing between the bovine and human gene 

map as part of the positional candidate cloning approach, HSA2q31-32 (map of the long arm of 
human chromosome 2, cytogenetic bands q31-32) and BTA2q12-22 (map of the arm of bovine 
chromosome 2, cytogenetic bands q12-22) were integrated on the basis of coincidence bovine 
YAC's as described below. 

35 Using a previously described experimental [(normal x double-muscled) x double- 

muscled] backcross population comprising 108 backcross individuals, the mh locus was recently 
mapped by linkage analysis to the centromeric tip of bovine chromosome 2 (BTA2), at 3.1 
centiMorgan proximal from the last marker on the linkage map: TGLA44 (Charlier et a/., 1995). It 



BNSDOCID: <WO 9902867A1 J_> 



WO 99/02667 PCT/IB98/01197 

-12- 

was also known from previous work that pro-o(lll) collagen (Col3AI) was located in the same 
chromosomal region as the mh locus. Col3AI has been mapped to BTA2q12-22 by in situ 
hybridization (Solinas-Toldo et a/., 1 995), while a Col3AI RFLP marker was shown to be closely 
linked to TGLA44 (6=2%)(Fisher et a/., 1996). This identifies the region flanking Col3AI on the 
5 human map, i.e. HSA2q31 -32, as the likely orthologous human chromosome segment This 
assumption is compatible with data from Zoo-FISH experiments (Solinas-Toldo etal, 1995) as 
well as mapping data of Type I markers on somatic cell hybrids (O'Brien etal, 1993), which 
establish an evolutionary correspondence between segments of HSA2q and BTA2. 

In order to refine the correspondence between the HSA2q31-33 and BTA2q12- 
1 0 22 maps. Comparative Anchored Tagged Sequences or CATS, i.e. primer pairs that would 

amplify a Sequence Tagged Site or STS from the orthologous gene in different species (Lyons et 
al., 1996), were developed for a series of genes flanking Col3A1 on the human map and for 
which sequence information was available in more than one mammal. In addition to Col3AI, 
working CATS were obtained for a2(V) collagen (Col5A2), inositol polyphosphate-1 phosphatase 
1 5 (INPP1), tissue factor pathway inhibitor precursor (TFPI), titin (TTAQ, n-chimaerin (CHAQ, 

glutamate decarboxylase 67 (GAD1). Cytotoxic T-lymphocyte-associated protein 4 (CTLA4) and 
T-cell membrane glycoprotein CD28 {CD28). The corresponding primer sequences are given in 
Table 1. 
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Table 1: 





CATS 




INPP1 


UP: 5' CAGCAAAGTCCTTAATGGTAACAAGC 3* 


DN: 5" GGGTCACTGAAGAAAACGTCCTG 3' 




COL3A1 


UP: S? CCCCATATTATGGAGATGAACCG 3 


DN: 5* AGTTCAGGATGGCAGAATTTCAG 3* 


5 


COL5A2 


UP: 5" GCAAACTGGGYGGRAGCAAGACC 3' 


DN: S TTSTTCCTGGGCTTTTATTGAGAC 3* 




TFP1 


UP: 5' AAGCCWGATTTCTGCTTYTTGGAAG 3* 


DN: 5* TGCCMAGGCAHCCRCCRTACTTGAA 3' 




TTN 


UP: S GGTCGTCCTACACCAGAAG 3* 


DN: 5' GGTTGACATTGTCAAGAACAAG 3' 




CHN 


UP: S TCTCMAAAGTCGTCTGTGACAATC 3' 


DN: 5 TGYTCRTTTTCTTTCAGAGTTGC 3* 




GAD1 


UP: 5' RCTGGTCCTCTTCACCTCAGAAC 3* 


DN: 5 ACATTGTCVGTTCCAAAGCCAAG 3* 


10 


CTLA4 


UP: 5* AGGTYCGGGTGACDGTGCTKC 3* 


DN: 5* TGGRTACATGAGYTCCACCTTGC 3' 




CD28 


UP: 5' AGCTGCARGTATWCCTACAAYCT 3' 


DN: 5* GTYCC RTTGCTC YTCTC RTTGYC 3* 




Microsatellite markers 




TGLA44 


UP.5' AACTGTATATTGAGAGCCTACCATG 3* 


DN: 5* CACACCTTAGCGACTAAACCACCA 3' 




BULGE27 


UP: 5 1 CTACCTAACAGAATGATTTTGTAAG 3* 


DN: 5' AGTGTTCTTGCCTAGAGAATCCCAG 3' 


15 


BULGE23 


UP: 5* ACATTCTCTCACCAATATGACATAC 3' 


DN: 5* TAAGTC ACC ATTACATC CTTAG AAC 3 




BM81124 


UP: 5' GCTGTAAGAATCTTCATTAAGCACT 3* 


DN: 5' CCTGATACATGCTAAGGTTAAAAAC 3" 




BULGE28 


UP: 5* AGGCATACATCTGGAGAGAAACATG 3* 


DN: 5' CAG AGGAGCCTAGCAGGCTAC CGTC 3' 




BULGE20 


UP: 5* CAGCAGGTCTGTTGAAGTGTATCAG 3* 


DN: 5' AGTGGTAGCATTCACAGGTAGCCAG 3* 




BM3627 


UP: 5" CAGTCCATGGCACCATAAAG 3' 


DN: S TCCGTTAGTACTGGCTAATTGC 3* 


20 


ILSTS026 


UP: 5' CTGAATTGGCTCCAAAGGCC 3' 


DN: 5' AAACAGAAGTCCAGGGCTGC 3* 




INRA40 


UP: 5* TCAGTCTCCAGGAGAGAAAAC 3* 


DN: 5* CTCTGCCCTGGGGATGATTG 3* 




Bovine Myostatin primers 




GDF8.19 


5' AATGTATGTTTATATTTACCTGTTCATG 3* 






GDF8.11 


5' ACAGTGTTTGTGCAAATCCTGAGAC 3* 




25 


GDF8.12 


5* CAATGCCTAAGTTGGATTCAGGTTG 3' 






GDF8.25 


5' CTTGCTGTAACCTTCCCAGGACCAG 3' 






GDF8.15 


5' TCCCATCCAAAGGCTTCAAAATC 3* 






GDF8.21 


5* ATACTCWAGGCCTAYAGCCTGTGGT 3* 





Reading from left to right and down the table, the sequences given in Table 1 are identified as 
30 SEQ ID NO:1 2 to SEQ ID NO;53 t respectively. 



These CATS were used to screen a 6-genome equivalent bovine YAC library by 
PCR using a three-dimensional pooling strategy as described by Libert et aL, 1993. The same 
YAC library was also screened with all microsatellite markers available for proximal BTA2, i.e. 
TGLA44, BM81124, BM3627. ILSTS026, INRA40 and TGLA431 (Kappes et a/., 1997). 

35 Potential overlap between the YACs obtained with this panel of STS's was 

explored on the basis of common STS content, as well as cross-hybridization between SINE- 
PCR product from individual YACs. From this analysis, three independent YAC contigs emerged 
in the region of interest: (i) contig A containing microsatellites TGLA44, BM81 124 and Type I 
marker INPP1\ (ii) contig B containing Col3AI and Col5A2\ and (iii) contig C containing 

40 microsatellite markers BM3627, ILSTS026 and INRA40, and Type I marker TFPI. 

None of the available microsatellites mapped to contig B, therefore this cluster 
of YACs could not be positioned in cattle with respect to the two other contigs. Available 
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mapping information in the human, however, allowed prediction of contig B's position between 
contigs A and C. To test this hypothesis, two new microsatellite markers were isolated from 
contig B, BULGE20 and BULGE28. BULGE20 proved to be polymorphic, allowing for 
genotyping of the experimental backcross population. 
5 In addition, to increase the informative ness of the markers available for contig A, 

two new microsatellite markers were developed from this contig: BULGE23 and BULGE27. 
BULGE23 proved to be polymorphic and was used to type the same pedigree material. 

All resulting genotypes were used to construct a linkage map using the ILINK 
program (Lathrop and Lalouel, 1984). The following most likely order and sex-averaged 
10 recombination rates between adjacent markers was obtained: [TGLA44-(0%)-BULG23H 6 .1 0 / 0 )- 
BULG20-(1 ,6%)-ILSTS026-(2.3%)-INRA40-(7,1%)-TGLA431 . The position of BULGE20 between 
TGLA44 and ILSTS026 confirmed the anticipated order of the three contigs. Figure 1 
summarizes the resulting mapping information. 

A multi point linkage analysis was undertaken using LINKMAP, to position the 
15 mh locus with respect to the new marker map. Linkage analysis was performed under a simple 
recessive model, assuming full penetrance for mh/mh individuals and zero penetrance for the 
two other genotypes. The LOD score curve shown in Figure 1 was obtained, placing the mh locus 
in the TGLA44-BULGE20 interval with an associated maximum LOD score of 26.4. Three 
backcross individuals were shown to recombine with the BULGE20 and distal markers, but not 
20 with TGLA44 and BULGE23, therefore placing the mh locus proximal from this marker. One 
individual, was shown to recombine with TGLA44 and BULGE23, but not with the more distal 
markers, therefore positioning the mh locus distal from TGLA44 and BULGE23. Given the 
relative position of these microsatellite markers with respect to INPP1 and Co!3AI as deduced 
from the integration of the human and bovine map, these results indicated that the mh gene is 
25 likely located in a chromosome segment bounded by INPP1 and Co!3AL 

Recently, McPherron et a/. (1 997) demonstrated that mice homozygous for a 
knock-out deletion of GDF-8 or myostatin, were characterized by a generalized increase in 
skeletal muscle mass. Using the published 2676bp murine myostatin cDNA sequence 
(GenBank accession number U84005), a Tentative Human Consensus (THC) cluster in the 
30 Unigene database was identified which represented three cDNA clones (221299, 300367, 
308202) and six EST (Expressed Sequence Tag) sequences (H92027, H92028, N80248, 
N95327, W07375, W24782). The corresponding THC covered 1296 bp of the human myostatin 
gene, showing an homology of 78.1% with the murine sequence when averaged over the entire 
sequence, and 91 .1% when considering only the translated parts of the human and murine 
35 genes (566bp). This THC therefore very likely corresponds to the human orthologue of the 

murine myostatin gene. Primers (S'-GGCCCAACTATGGATAT ATTIC'S 1 (SEQ ID NO:9) and 5 f - 
GGTCCTGGGAAGGTTACAGCA-3* (SEQ ID NO:10)) were thus prepared to amplify a 272 bp 
fragment from the second exon of human myostatin and used to genotype the whole-genome 
Genebridge-4 radiation hybrid panel (Walter et a/., 1994). The RHMapper program (Slonim et 
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a/., unpublished) was used to position the myostatin gene with respect to the Whitehead/MIT 
framework radiation hybrid map, placing it at position 948.7 cR of the HSA2 map with an 
associated lodscore > 3. Closer examination of the myostatin segregation vector and its 
confrontation with the vectors from all markers located in that region (Data Release 1 1 .9, May 
5 1 997) showed it to be identical to EST SGC38239 placed on the Whitehead/MIT radiation hybrid 
map (Hudson e/a/. ( 1995) at position 946.8 cR of HSA2. This places the human myostatin gene 
on the RH-map in the interval between Col3AI (EST WI16343 - 942.5 cR) and INPP1 (EST 
L08488 - 950.2 to 951 J2 cR)(Figure 1). Myostatin therefore appeared as a very strong positional 
candidate for the mh gene. 

10 To test the potential involvement of myostatin in the determinism of double- 

muscling in cattle, primer pairs were designed based on the available mouse and human 
myostatin sequence, with the objective to amplify the entire coding sequence from bovine cDNA 
using PCR (Polymerase Chain Reaction). Whenever possible, primers were therefore positioned 
in portions of the myostatin sequence showing 100% homology between mouse and human. 

1 5 Two primer pairs were identified that amplified what was predicted to represent 98.4% of the 
bovine coding sequence plus 74 bp of 3 1 untranslated sequence, in two overlapping DNA 
fragments, respectively 660 (primers GDF8.19 - GDF8.12) and 724 bp (primers GDF8.11 - 
GDF8.21) long. The expected DNA products were successfully amplified from cDNA 
generated from skeletal muscle of both a normal (homozygous +/+) (SEQ ID NO:1) and a 

20 double-muscled (homozygous mh/mh) (SEQ ID NO:3) animal, and cycle-sequenced on both 
strands. 

The nucleotide sequence corresponding to the normal allele presented 88.9% 
identity with the mouse myostatin sequence (SEQ ID NO:5) over a 1067 bp overlap, and 
contained the expected open reading frame encoding a protein (SEQ ID NO:2) showing 92.9% 

25 identity in a 354 amino-acid overlap with mouse myostatin (SEQ ID NO:6). As expected for a 
member of the TGFp superfamily, the bovine myostatin gene is characterized by a proteolytic 
processing site thought to mediate cleavage of the bioactive carboxy-terminal domain from the 
longer N-terminal fragment, and by nine cysteine residues separated by a characteristic spacing 
and suspected to be involved in intra- and inter-molecular disulfide bridges (McPherron and Lee, 

30 1996). 

The nucleotide sequence obtained from the mh allele was identical to the 
normal allele over its entire length, except for an 11 bp deletion involving nucleotides 821 to 831 
(counting from the initiation codon). This frame shifting deletion, occurring after the first cysteine 
residue of the carboxy-terminal domain, drastically disrupts the downstream amino-acid 
35 sequence and reveals a premature stop-codon after 13 amino acids, see Figure 2. The amino 
acid sequence encoded by the mutant nucleic acid sequence is identified as SEQ ID NO:4. This 
mutation disrupts the bioactive part of the molecule and is therefore very likely to be the cause of 
the recessive double-muscling phenotype. Following conventional nomenclature, this mutation 
will be referred to as nt821(de!11). 
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To further strengthen the assumption of the causality of this mutation, primer 
pairs flanking the deletion (Figure 2) were prepared and the corresponding DNA segment from 
all animals from the experimental backcross population amplified. PCR was performed in the 
presence of dCTP 32 in order to radioactively label the amplification product. Amplification 
5 products were separated on denaturing polyacrylamide gels and detected by autoradiography. A 
188 bp product would be expected for the normal allele and a 177 bp product for the 
nt821(del11) allele. Correlation between phenotype and genotype was matched for the entire 
pedigree. All ten BBCB double-muscled sires were found to be homozygous for the 
nt821(del11) mutation, all 41 F1 females were heterozygous, while 53 double-muscled offspring 
1 0 were homozygous for the mutation, the remaining 55 conventional animals were heterozygous. 

To examine the distribution of the nt821(del11) mutation in different conventional 
and double-muscled breeds, a cohort of 25 normal individuals was genotyped representing two 
dairy breeds (Hoistein-Friesian, Red-and -White) and a cohort of 52 double-muscled animals 
representing four breeds (BBCB, Asturiana, Maine-Anjou and Piemontese). The results are 
1 5 summarized in Table 2. All dairy animals were homozygous normal except for one Red-and- 
White bull shown to be heterozygous. The occurrence of a small fraction of individuals carrying 
the mutation in dairy cattle is not unexpected as the phenotype is occasionally described in this 
breed, in BBCB and Asturiana, all double-muscled animals were homozygous for the 
nt821(del1 1) deletion, pointing towards allelic homogeneity in these two breeds. Double- 
20 muscled Maine-Anjou and Piemontese animals were homozygous "normal", i.e. they did not 
show the nt821(deI11) deletion but a distinct cysteine to tyrosine substitution (C313Y) in double- 
muscled Ptedmontese animals identified by others (Kambadur et a/., 1997) was discovered. 



Table 2: 





Breed 


Phenotype 




Genotype 




25 






+/+ 


+/nt821(deh1) 


nt821(det1 1)/nt821(del11) 




Belgian Blue 


DM 






29 




Asturiana 


DM 






10 




Piemontese 


DM 


8 






30 


Maine-Anjou 


DM 


4 








Hoistein-Friesian 


Normal 


13 








Red-and-White 


Normal 


12 


1 





The entire coding sequence was also determined for the myostatin gene in 
double-muscled individuals from ten European cattle breeds and a series of mutations that 
35 disrupt myostatin function were identified. 
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The coding sequence of four control Holstein-Friesian and Jersey individuals 
was identical to the previously described wild-type allele (Grobet et a/., 1997), further indicating 
that it was the genuine myostatin coding sequence being amplified, and not a non-functional 
pseudogene. 

5 Amongst the 32 double-muscled animals, seven DNA sequence variants within 

the coding region were found, as summarized in Figure 3. 

In addition to the nt821(de!11) mutation in the third exon, described above, four 
new mutations that would be expected to disrupt the myostatin function were found. An 
insertion/deletion at position 419 counting from the initiation codon, replacing 7 base pairs with an 

10 apparently unrelated stretch of 10 base pairs, reveals a premature stop codon in the N-terminal 
latency-associated peptide at amino-acid position 140. This mutation is referred to as 
nt419(de!7-ins10). Two base pair substitutions in the second exon, a C-*T transition at 
nucleotide position 610 and a G— >T transversion at nucleotide position 676, each yield a 
premature stop codon in the same N-terminal latency-associated peptide at amino-acid positions 

15 204 and 226 respectively. These mutations are called Q204X and E226X respectively. Finally, a 
G-*A transition at nucleotide position 938 results in the substitution of a cysteine by a tyrosine. 
This mutation is referred to as C313Y. This cysteine is the fifth of nine highly conserved cysteine 
residues typical of the members of the TGF-p superfamily and shared in particular by TGF-pi , - 
P2 and -p3, and inhibin-pA and -pB (McPherron & Lee, 1996). It is thought to be involved in an 

20 intramolecular disulfide bridge stabilizing the three-dimensional conformation of the bioactive 
carboxyterminal peptide. Its substitution is therefore likely to affect the structure and function of 
the protein. This C313Y has recently also been described by Kambadur et a/. (1997). 

A conservative phenylalanine to leucine substitution was also found at amino- 
acid position 94 in the first exon, due to a C-*A transversion at nucleotide position 282 of the 

25 myostatin gene. Given the conservative nature of the amino-acid substitution, its location in the 
less conserved N-terminal latency-associated peptide, and as this mutation was observed at the 
homozygous condition in animals that were not showing any sign of exceptional muscular 
development, this mutation probably does not interfere drastically with the myostatic function of 
the encoded protein, if at all. This mutation is referred to as F94L The murine protein is 

30 characterized by a tyrosine at the corresponding amino-acid position. 

Also identified was a silent C— ►T transition at the third position of the 138th 
cytosine codon in the second exon, referred to as nt414(C-T). • 

In addition to these DNA sequence polymorphisms detected in the coding region 
of the myostatin gene, also found were four DNA sequence variants in intronic sequences which 

35 are probably neutral polymorphisms and which have been assigned the following symbols: 

nt374-51(T-C), nt374-50(G-A), nt374-16(det1) in intron 1, and nt748-78(del1) in intron 2 (Figure 
3). 

Figure 4 shows the observed distribution of mutations in the analysed sample 
sorted by breed. For the majority of the studied breeds, the analyzed double-muscled animals 
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were homozygous for one of the five described mutations expected to disrupt the myostatin 
function or compound heterozygotes for two of these mutations. This is compatible with the 
hypothesis that the double-muscled condition has a recessive mode of inheritance in all these 
breeds. 

5 Only in Limousin and Blonde d'Aquitaine was there no clear evidence for the 

role of myostatin loss-of-f unction mutations in the determinism of the observed muscular 
hypertrophy. Most Limousin animals were homozygous for the conservative F94L substitution 
which is unlikely to cause the muscular hypertrophy characterizing these animals, as discussed 
above. One Limousin animal proved to be heterozygous for this mutation, the other allele being 
1 0 the "wild-type" one. All Blonde d'Aquitaine animals were homozygous "wild-type". These data 
indicate either that the myostatin gene is possibly not involved in the double-muscled condition 
characterizing these two breeds, or that there are additional myostatin mutations outside of the 
coding region. The double-muscling condition is often considered to be less pronounced in 
Limousin animals compared to other breeds. 
15 The data indicate that some mutations, such as the nt821del(11) and C313Y, 

are shared by several breeds which points towards gene migration between the corresponding 
populations, while others seems to be confined to specific breeds. Moreover, while some breeds 
(the Belgian Blue breed in particular) seem to be essentially genetically homogeneous others 
show clear evidence for allelic heterogeneity (e.g. Maine-Anjou). 
20 The observation of allelic heterogeneity contradicts with the classical view that a 

single mh mutation spread through the European continent in the beginning of the 19th century 
with the dissemination of the Shorthorn breed from the British Isles (Menissier, 1982). Two of 
the mutations at least are shared by more than one breed, indicating some degree of gene 
migration but definitely not from a single origin. 
25 In mice, and in addition to the in vitro generated myostatin knock-out mice 

(McPherron & Lee, 1997), the compact mutation could be due to a naturally occurring mutation 
at the myostatin gene. The compact locus has been mapped to the D1Mit375-D1Mit21 interval 
on mouse chromosome 1 known to be orthologous to HSA2q31-32 and BTA2q12-22 fVarga et 
a/. t 1997). 

30 From an applied point of view, the characterisation of a panel of mutations in 

the myostatin gene associated with double-muscling contributes to the establishment of a 
diagnostic screening system allowing for marker assisted selection for or against this condition in 
cattle. 

Example 1 
35 Genetic and physical mapping 

Integration of the HSA2q31-32 and BTA2q12-22 maps was done by using 
coincident YAC's and the mh locus was positioned in the interval flanked by Col3AI and INPP1 as 
follows. Genetic mapping was performed using a previously described (Holstein-Friesian x 
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Belgian Blue) x Belgian Blue experimental backcross population counting 108 informative 
individuals (Charlier et aL, 1 995). Microsatellite genotyping was performed according to standard 
procedures (Georges et aL, 1995), using the primer sequences reported in Table 1 . Linkage 
analyses were performed with the MLINK, ILINK and LINKMAP programs of the LINKAGE 
5 (version 5.1) and FASTLINK (2.3P version, June 1995) packages (Lathrop & Lalouel, 1984; 
Cottingham et aL, 1 993). The YAC library was screened by PCR using a three dimensional 
pooling scheme as described in Libert et aL, 1993. The primer pairs corresponding to the CATS 
used to screen the library are reported in Table 1 . Cross-hybridisation between SINE-PCR 
products of individual YACs was performed according to Hunter et at. (1996), using primers 
10 reported in Lenstra et aL (1993). Microsatellites were isolated from YACs according to Cornells 
etaL (1992). 

Example 2 

Mapping of the human myostatin gene on the Genebridge-4-panel 

DNA from the Genebridge-4 panel (Walter etaL, 1994) was purchased from 
15 Research Genetics (Huntsville, Alabama), and genotyped by PCR using standard procedures 
and the following human myostatin primer pair (S'-GGCCCAACTATGGATATATTTG-S' and 5 - 
GGTCCTCGGAAGGTrACAGCA-S 1 ). Mapping was performed via the WWW server of the 
Whitehead Institute/MIT Center for Genome Research using their RH-mapper program (Slonim, 
D.; Stein, L.; Kruglyak, L.; Lander, E., unpublished) to position the markers with respect to the 
20 framework map. Segregation vectors of the query markers were compared with the vectors from 
all markers in the region of interest in the complete Data Release 1 1 .9 (May 1997) to obtain a 
more precise position. This positions myostatin in the INPP1-Col3AI on the human map with LOD 
score superior to 3. 

Example 3 
25 RT-PCR 

To clone the bovine myostatin orthologue a strategy based on RT-PCR 
amplification from skeletal muscle cDNA was chosen. Total RNA was extracted from skeletal 
muscle (Triceps brachialis) according to Chirgwin etaL (1979). RT-PCR was performed using 
the Gene-Amp RNA PCR Kit (Perkin-Elmer) and the primers reported in Table 1 . The PCR 
30 products were purified using QiaQuick PCR Purification kit (Qiagen) and sequenced using Dye 
terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic 
sequencer, using the primers reported in Table 2. 

Example 4 

Diagnosis of the nt821(de!11) deletion 

35 To diagnose the nt821 (dell 1) the following primer sequences were designed 

flanking the nt821(de!1 1) deletion: S'-TCTAGGAGAGATTTTGGGCTT-S' (SEQ ID NO:53) and 5- 
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GATGGGTATGAGGATACTTTTGC-3 , (SEQ ID NO:52). These primers amplify a 188 bp 
fragments from normal individuals and a 177bp fragment from double-muscled individuals. 
Heterozygous individuals show the two amplification products. These amplification products can 
be detected using a variety of methods. In this example the PGR product was labelled by 

5 incorporation of dCTP 32 , separated on a denaturing acrylamide gel and revealed by 
autoradiography. Other approaches that could be used to distinguish the three different 
genotypes are known to those skilled in the art and would include separation in agarose gels and 
visualization with ethidium bromide, direct sequencing, TaqMan assays, hybridization with allele 
specific oligonucleotides, reverse dot-blot, RFLP analysis and several others. The specificity of 

1 0 the test is linked to the detected mutation and not to the primers used in the detection method. 
That means that other primers can easily be designed based on said bovine myostatin sequence 
that would fulfill the same requirements. 

Example 5 

Determination of mutations in other breeds 
15 a total of 32 animals with extreme muscular development were sampled from 

ten European beef cattle breeds in which double-muscled animals are known to occur at high to 
moderate frequency: (i) Belgium: Belgian Blue (4), (u) France: Blonde d'Aquitaine (5), Charolais 
(2), Gasconne (2), Limousin (5), Maine-Anjou (4), Parthenaise (3), fiii) Spain: Asturiana (2), 
Rubia Gallega (2), (iv) Italy: Piedmontese (2). The determination of the double-muscled 
20 phenotype of the sampled animals was performed visually by experienced observers. Four 

animals with conventional phenotype sampled from the Holstein-Friesian (2) and Jersey (2) dairy 
populations were analysed as controls. 

In order to facilitate the study of the myostatin coding sequence from genomic 
DNA, the sequences of the exon-intron boundaries of the bovine gene were determined. In mice, 
25 the myostatin gene is known to be interrupted by two introns, respectively *1 .5 and 2.4 Kb long 
(McPherron & Lee, 1997). Two primer pairs were thus designed, respectively, in bovine exons 1 
and 2, and exons 2 and 3, that were predicted to flank the two introns, assuming conservation of 
gene organisation between mouse and cattle (Figure 3 and Table 3). Using these primer sets, 
two PGR products respectively 2Kb and 3.5Kb long were generated from a YAC clone (179A3) 
30 containing the bovine myostatin gene (Grobet et a/., 1997). The PCR products were purified 
using QiaQuick PCR Purification kit (Qiagen) and partially sequenced using Dye terminator Cycle 
Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic sequencer. Alignment 
with the bovine cDNA sequence identified the four predicted exon-intron boundaries. The 
nucleotide sequence corresponding to bovine genomic DNA is identified as SEQ ID NO:54. 
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Table 3: Primers used for PCR amplification and cycle sequencing. 



irurun 1 -o 


O-OnnOMLiOn 1 OnU 1 MOUnL 

GCCAGGACG-y 


1 nfr/sn 4 T 

inironi-o 


S^CTAGTTTATTGTATTGTATCTT 
AGAGC-3* 


inironz-o 


TTGT-3* 


intronz-3 


5 -ATACTCWAGGCCTAYAGCCT 
GTGGT-3' 


Exonl -5* 


O -rt 1 1 l>nw I OO 1 1 ooUnnu 

TTGTCTCTCAGA-3' 


txom-o 


b -CCCTCCTCCTTACATACAAGC 
CAGCAG-3* 




c> f^TTOATAfi ATTfZATATf^f2 

AG GTGTTCG-3' 


cxonz-o 


5 -ATAAGCACAGGAAACTGGTAG 
TTATT-3* 




AATGATTAG-3* 


txon*5-3 


5 -ATACTCWAGGCCTAYAGCCT 
GTGGT-3* 


cxon iroeq i 


D - 1 1 uAuoA 1 vj I Ao 1 b 1 1 1 1 

CC-3' 


Exonl -Seq2 


5-GCCATAAAAATCCAAATCCTC 
AG-3* 


Exon2-Seq1 


S'-CATTTATAGCTGATCTTC 
TAACGCAAG-3* 


Fvnn9. Con? 


o-lbl ^CjOACjCjACj I GTTGACAG 
GCCTCAG-3' 


Exon2-Seq3 


5-GTACAAGGTATACTGGAA 
TCCGATCTC-3' 






Exon3-Seq1 


5-AGCAGGGGCCGGCTGAA 
CCTCTGGG-3' 


Exon3-Seq2 


5-CCCCAGAGGTTCAGCCGGCC 
CCTGC-3* 



Based on the available exonic and intronic sequences of the bovine myostatin 
gene, three primer pairs that jointly allow for convenient amplification of the entire coding 
sequence from genomic DNA were designed. The position of the corresponding primers is 
shown in Figure 3, and the corresponding sequences are reported in Table 3. 

15 After PCR amplification of the entire coding sequence from genomic DNA in the 

three described fragments, these were purified using QiaQuick PCR Purification kit (Qiagen) and 
sequenced using Dye terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an 
ABI373 automatic sequencer, using the primers used for amplification as well as a series of 
nested primers (Figure 3 and Table 3). Chromat files produced with the ABI373 sequencer were 

20 analysed with the Polyphred application (D. Nickerson, personal communication), which is part of 
a series of sequence analysis programs including Phred (Ewing, B. & Green, P. (1992), 
unpublished), Phrap (Green, P. (1994), unpublished) and Consed (Gordon, D. (1995), 
unpublished), but any suitable sequencing programme would do, as known to a person skilled in 
the art. 

25 Monoclonal antibodies (Mab's) specific for myostatin are useful. In the case of 

the bovine protein having the amino acid sequence identified as SEQ ID NO:2 t for example, 
antibodies can be used for diagnostic purposes such as for determining myostatin protein levels 
in muscle tissue. To produce these antibodies, purified myostatin is prepared. The myostatin 
can be produced in bacterial cells as a fusion protein with glutathione-S-transferase using the 

30 vector pGEX2 (Pharmacia). This permits purification of the fusion protein by GSH affinity 
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chromatography. In another approach, myostatin is expressed as a fusion protein with the 
bacterial maltose binding domain. The fusion protein is thus recovered from bacterial extracts by 
passing the extract over an amylose resin column followed by elution of the fusion protein with 
maltose. For this fusion construct, the vector pMalC2, commercially available from New England 
5 Biolabs, can be used. The preparation of a second fusion protein is also useful in the preliminary 

screening of MAb's. 

The generation of hybridomas expressing monoclonal antibodies recognizing 
myostatin protein is carried out as follows: BALB/c mice are injected intraperitoneal^ with 
protein/adjuvant three times at one-month intervals, followed by a final injection into the tail vein 
1 0 shortly prior to cell fusion. Spleen cells are harvested and fused with NS-1 myeloma cells 

(American Type Culture Collection, Rockville, MD) using polyethylene glycol 4000 according to 
standard protocols (Kennett, 1979; Mirski, 1 989). The cell fusion process is carried out as 
described in more detail below. 

The fused cells are plated into 96-well plates with peritoneal exudate cells and 
15 irradiated spleen cells from BALB/Ccmice as feeder layers and selection with hypoxanthine, 
aminopterin, and thymidine (HAT medium) is performed. 

An ELISA assay is used as an initial screening procedure. 1-10 pg of purified 
myostatin (cleaved from the fusion protein) in PBS is used to coat individual wells, and 50-100 pi 
per well of hybridoma supernatants is incubated. Horseradish peroxidase-conjugated anti- 
20 mouse antibodies are used for the colorimetric assay. 

Positive hybridomas are cloned by limiting-dilution and grown to large-scale for 
freezing and antibody production. Various positive hybridomas are selected for usefulness in 
western blotting and immunohistochemistry, as well as for cross reactivity with myostatin proteins 
from different species, for example the mouse and human proteins. 
25 Alternatively, active immunization by the generation of an endogenous antibody 

by direct exposure of the host animal to small amounts of antigen can be carried out. Active 
immunization involves the injection of minute quantities of antigen (g) which probably will not 
induce a physiological response and will be degraded rapidly. Antigen will only need to be 
administered as prime and boost immunizations in much the same manner as techniques used 
30 to confer disease resistance (Pell et a/., 1997). 

Antisense nucleic acids or oligonucleotides (RNA or preferably DNA) can be 
used to inhibit myostatin production in order to increase muscle mass of an animal. Antisense 
oligonucleotides, typically 15 to 20 bases long, bind to the sense mRNA or pre mRNA region 
coding for the protein of interest, which can inhibit translation of the bound mRNA to protein. The 
35 cDNA sequence encoding myostatin can thus be used to design a series of oligonucleotides 
which together span a large portion, or even the entire cDNA sequence. These oligonucleotides 
can be tested to determine which provides the greatest inhibitory effect on the expression of the 
protein (Stewart, 1996). The most suitable mRNA target sites include 5- and 3-untranslated 
regions as well as the initiation codon. Other regions might be found to be more or less effective. 
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Alternatively, an antisense nucleic acid or oligonucleotide may bind to myostatin coding or 
regulatory sequences. 

Rather than reducing myostatin activity by inhibiting myostatin gene expression 
at the nucleic acid level, activity of the myostatin protein may be directly inhibited by binding to an 
5 agent, such as, for example, a suitable small molecule or a monoclonal antibody. 

ft will of course be understood, without the intention of being limited thereby, that 
a variety of substitutions of amino acids is possible while preserving the structure responsible for 
myostatin activity of the proteins disclosed herein. Conservative substitutions are described in the 
patent literature, as for example, in United States Patent No. 5,264,558 or 5,487,983. It is thus 

10 expected, for example, that interchange among non-polar aliphatic neutral amino acids, glycine, 
alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions among the 
polar aliphatic neutral amino acids, serine, threonine, methionine, asparagine and glutamine 
could possibly be made. Substitutions among the charged acidic amino acids, aspartic acid and 
glutamic acid, could probably be made, as could substitutions among the charged basic amino 

15 acids, lysine and arginine. Substitutions among the aromatic amino acids, including 

phenylalanine, histidine, tryptophan and tyrosine would also likely be possible. These sorts of 
substitutions and interchanges are well known to those skilled in the art. Other substitutions 
might well be possible. Of course, it would also be expected that the greater the percentage of 
homology, i.e., sequence similarity, of a variant protein with a naturally occurring protein, the 

20 greater the retention of metabolic activity. Of course, as protein variants having the activity of 
myostatin as described herein are intended to be within the scope of this invention, so are 
nucleic acids encoding such variants. 

A further advantage may be obtained through chimeric forms of the protein, as 
known in the art. A DNA sequence encoding the entire protein, or a portion of the protein, could 

25 thus be linked, for example, with a sequence coding for the C-terminal portion of E. coli B- 

gaiactosidase to produce a fusion protein. An expression system for human respiratory syncytial 
virus glycoproteins F and G is described in United States Patent No. 5,288,630 issued February 
22, 1994 and references cited therein, for example. 

A recombinant expression vector of the invention can be a plasmid, as described 

30 above. The recombinant expression vector of the invention further can be a virus, or portion 
thereof, which allows for expression of a nucleic acid introduced into the viral nucleic acid. For 
example, replication defective retroviruses, adenoviruses and adeno-associated viruses can be 
used. 

The recombinant expression vectors of the invention can be used to make a 
35 transformant host cell including the recombinant expression vector. The term "transformant host 
cell" is intended to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms "transformed with", 
"transfected with", "transformation" and "transfection" are intended to encompass introduction of 
nucleic acid (e.g. a vector) into a cell by one of many possible techniques known in the art. 
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Prokaryotic cells can be transformed with nucleic acid by, for example, electroporation or 
calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells 
via conventional techniques such as calcium phosphate or calcium chloride coprecipitation, 
DEAE-dextran-mediated transfection, lipofection, electroporation or microinjection. Suitable 
5 methods for transforming and transfecting host cells are known (Sambrook, 1 989). 

The number of host cells transformed with a recombinant expression vector of 
the invention by techniques such as those described above will depend upon the type of 
recombinant expression vector used and the type of transformation technique used. Plasmid 
vectors introduced into mammalian cells are integrated into host cell DNA at only a low 
1 0 frequency. In order to identify these integrants, a gene that contains a selectable marker (e.g. 
resistance to antibiotics) is generally introduced into the host ceils along with the gene of interest. 
Preferred selectable markers include those which confer resistance to certain drugs, such as 
G418 and hygromycin. Selectable markers can be introduced on a separate plasmid from the 
nucleic acid of interest or, preferably, are introduced on the same plasmid. Host cells 
1 5 transformed with one or more recombinant expression vectors containing a nucleic acid of the 
invention and a gene for a selectable marker can be identified by selecting for cells using the 
selectable marker. For example, if the selectable marker encodes a gene conferring neomycin 
resistance (such as pRc/CMV), transformant cells can be selected with G41 8. Ceils that have 
incorporated the selectable marker gene will survive, while the other cells die. 
20 Nucleic acids which encode myostatin proteins can be used to generate 

transgenic animals. A transgenic animal (e.g., a mouse) is an animal having cells that contain a 
transgene, which transgene is introduced into the animal or an ancestor of the animal at a 
prenatal, e.g., an embryonic stage. A transgene is a DNA which is integrated into the genome of 
a cell from which a transgenic animal develops. In one embodiment, a bovine cDNA, comprising 
25 the nucleotide sequence shown in SEQ ID NO:1 , or an appropriate variant or subsequence 
thereof, can be used to generate transgenic animals that contain cells which express bovine 
myostatin. Likewise, variants such as mutant genes (e.g. SEQ ID NO:3) can be used to generate 
transgenic animals. This could equally well be done with the human myostatin protein and 
variants thereof. "Knock out" animals, as described above, can also be generated. Methods for 
30 generating transgenic animals, particularly animals such as mice, have become conventional in 
the art are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. In a preferred 
embodiment, plasmids containing recombinant molecules of the invention are microinjected into 
mouse embryos. In particular, the plasmids are microinjected into the male pronuclei of fertilized 
one-cell mouse eggs; the injected eggs are transferred to pseudo-pregnant foster females; and, 
35 the eggs in the foster females are allowed to develop to term. (Hogan, 1986). Alternatively, an 
embryonal stem cell line can be transfected with an expression vector comprising nucleic acid 
encoding a myostatin protein, and cells containing the nucleic acid can be used to form 
aggregation chimeras with embryos from a suitable recipient mouse strain. The chimeric 
embryos can then be implanted into a suitable pseudopregnant female mouse of the appropriate 
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strain and the embryo brought to term. Progeny harboring the transfected DNA in their germ 
cells can be used to breed uniformly transgenic mice. 

Such animals could be used to determine whether a sequence related to an 
intact myostatin gene retains biological activity of myostatin. Thus, for example, mice in which 
5 the murine myostatin gene has been knocked out and containing the nucleic acid sequence 
identified as SEQ ID NO:1 could be generated along with animals containing the nucleic acid 
sequence identified as SEQ ID NO:3. The animals could be examined for display of muscular 
hyperplasia, especially in comparison with knockout mice, which are known to display such. In 
this way it can be shown that the protein encoded by SEQ ID NO:3 lacks myostatin activity within 

1 0 the context of this invention while the protein encoded by the nucleic acid sequence identified as 
SEQ ID NO:1 possesses biological activity of myostatin. 

In such experiments, muscle cells would be particularly targeted for myostatin 
(and variants) transgene incorporation by use of tissue specific enhancers operatively linked to 
the encoding gene. For example, promoters and/or enhancers which direct expression of a 

15 gene to which they are operatively linked preferentially in muscle cells can be used to create a 
transgenic animal which expresses a myostatin protein preferentially in muscle tissue. 
Transgenic animals that include a copy of a myostatin transgene introduced into the germ line of 
the animal at an embryonic stage can also be used to examine the effect of increased myostatin 
expression in various tissues. 

20 The pattern and extent of expression of a recombinant molecule of the invention 

in a transgenic mouse is facilitated by fusing a reporter gene to the recombinant molecule such 
that both genes are co-transcribed to form a polycistronic mRNA. The reporter gene can be 
introduced into the recombinant molecule using conventional methods such as those described 
in Sambrook et a/., (Sambrook, 1989). Efficient expression of both cistrons of the polycistronic 

25 mRNA encoding the protein of the invention and the reporter protein can be achieved by 

inclusion of a known internal translational initiation sequence such as that present in poliovirus 
mRNA. The reporter gene should be under the control of the regulatory sequence of the 
recombinant molecule of the invention and the pattern and extent of expression of the gene 
encoding a protein of the invention can accordingly be determined by assaying for the phenotype 

30 of the reporter gene. Preferably the reporter gene codes for a phenotype not displayed by the 
host cell and the phenotype can be assayed quantitatively. Examples of suitable reporter genes 
include lacZ (p-galactosidase), neo (neomycin phosphotransferase), CAT (chloramphenicol 
acetyltransferase) dhfr (dihydrofolate reductase), aphlV (hygromycin phosphotransferase), lux 
(luciferase), uidA (p-glucuronidase). Preferably, the reporter gene is lacZ which codes for p- 

35 galactosidase. p-galactosidase can be assayed using the lactose analogue X-gal (5-bromo-4- 
chloro-3-indolyl-b-D-galactopyranoside) which is broken down by p-galactosidase to a product 
that is blue in color (Old). 

The present invention includes knocking out wild type myostatin in mammals, in 
order to obtain the desired effect(s) thereof. This is particularly true in cattle raised for beef 
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production. It may well prove advantageous to substitute a defective gene (e.g. SEQ ID NO:3 or 
it genomic analogue) rather than delete the entire sequence of DNA encoding for a protein 
having myostatin activity. A method of producing a transgenic bovine or transgenic bovine 
embryo is described in United States Patent No. 5,633,076. issued May 27, 1997, for example. 
5 The transgenic animals of the invention can be used to investigate the molecular 

basis of myostatin action. For example, it is expected that myostatin mutants in which one or 
more of the conserved cysteine residues has been deleted would have diminished activity in 
relation to a wild type myostatin protein in which all such residues are retained. Further, deletion 
of proteolytic cleavage site would likely result in a mutant lacking biological activity of myostatin. 
10 Transgenesis can be used to inactivate myostatin activity. This could be 

achieved using either conventional transgenesis, i.e. by injection in fertilized oocytes, or by gene 
targeting methods using totipotent cell lines such as ES (embryonic stem cells) which can then 
be injected in oocytes and participate in the development of the resulting organisms or whose 
nucleus can be transferred into unfertilized oocytes, nucleus transfer or cloning. 
15 it is also possible to create a genetically altered animal in which the double- 

muscling trait is dominant so that the animal would be more useful in cross-breeding. Further, in 
a particular aspect, the dominant trait would be male specific. In this way, bulls would be double- 
muscled but cows would not be. In addition, or alternatively, the trait would also be unexpressed 
until after birth or inducible. If inducible the trait could be induced after birth to avoid the calving 
20 difficulties described above. 

There are at least three approaches that can be taken to create a dominant 
u mh n allele. Because functional myostatin, a member of the TGF-B superfamily, is a dimer, 
dominant negative myostatin mutations can be created (Herskowitz et a/., 1987; Lopez et a/., 
1 992). According to one method, this is accomplished by mutating the proteolytic processing 
25 site of myostatin. To enhance the dominant negative effect, the gene can be put under the 
control of a stronger promoter such as the CMV promoter or that of a myosin gene, which is 
tissue specific, i.e., expressed only in skeletal muscle. Alternatively, an antisense sequence of 
that encoding myostatin could be incorporated into the DNA, so that complementary mRNA 
molecules are generated, as understood by a person skilled in the art. Optionally, a ribozyme 
30 could be added to enhance mRNA breakdown. In another approach, ere recombinase 

generate/ribozyme approach or the Cre-lox P system could be used (Hoess et al , 1 982; Gu et 
a/., 1994). 

Male specificity can be achieved by placing the dominant mh alleles on the Y 
chromosome by homologous recombination. 
35 Inducibility can be achieved by choosing promoters with post-natal expression in 

skeletal muscle or using inducible systems such at he Tet-On and Tet-Off systems could be 
used (Gossen et a/., 1992; Shockett etaL, 1996). 

Using conventional transgenesis a gene coding for a myostatin antisense is 
injected, for example, by inverting the orientation of the myostain gene in front of its natural 
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promoter and enhancer sequences. This is followed by injection of a gene coding for an anti- 
myostain ribozyme, i.e. an RNA that would specifically bind to endogenous myostain mRNA and 
destroy it via its "ribozyme" activity. 

Also, through gene targeting, a conventional knock-out animal can be 
5 generated, specific mutations by gene replacement can be engineered. It is possible to 

inactivate the myostain gene at a specific developmental time, such as after birth to avoid calving 
difficulties. As mentioned above, this could be achieved using the Cre-lox P systems in which 
1 .ox P sides are engineered around the myostain gene by homologous recombination (gene 
targeting), and mating these animals with transgenic animals having a Cre transgene (coding for 

10 the Cre recombinase existing DNA flanked by J oxP sides) under the dependence of a skeletal 
muscle specific promoter only active after birth. This is done to obtain individuals that would 
inactivate their myostain gene after birth. As mentioned above, there are also gene targeting 
systems that allow genes to be turned on and off by feeding an animal with, for example, an 
antibiotic. In such an instance, one engineers an operator between the promoter of the gene and 

1 5 the gene itself. This operator is the target of a repressor which when binding inactivates the gene 
(for example, the lac operon in E. cofi). The repressor is brought into the cell using conventional 
transgenesis, for example, by injection of the gene coding for the repressor. 

Transgenic animals of the invention can also be used to test substances for the 
ability to prevent, slow or enhance myostatin action. A transgenic animal can be treated with the 

20 substance in parallel with an untreated control transgenic animal. 

The antisense nucleic acids and oligonucleotides of the invention are useful for 
inhibiting expression of nucleic acids (e.g. mRNAs) encoding proteins having myostatin activity. 

The isolated nucleic acids and antisense nucleic acids of the invention can be 
used to construct recombinant expression vectors as described previously. These recombinant 

25 expression vectors are then useful for making transformant host cells containing the recombinant 
expression vectors, for expressing protein encoded by the nucleic acids of the invention, and for 
isolating proteins of the invention as described previously. The isolated nucleic acids and 
antisense nucleic acids of the invention can also be used to construct transgenic and knockout 
animals as described previously. 

30 The isolated proteins of the invention are useful for making antibodies reactive 

against proteins having myostatin activity, as described previously. Alternatively, the antibodies of 
the invention can be used to isolate a protein of the invention by standard immunoaffinity 
techniques. Furthermore, the antibodies of the invention, including bispecific antibodies are 
useful for diagnostic purposes. 

35 Molecules which bind to a protein comprising an amino acid sequence shown in 

SEQ ID NO:2 can also be used in a method for killing a cell which expresses the protein, wherein 
the cell takes up the molecule, if for some reason this were desirable. Destruction of such cells 
can be accomplished by labeling the molecule with a substance having toxic or therapeutic 
activity. The term "substance having toxic or therapeutic activity" as used herein is intended to 
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include molecules whose action can destroy a cell, such as a radioactive isotope, a toxin (e.g. 
diphtheria toxin or ricin), or a chemotherapeutic drug, as well as cells whose action can destroy a 
cell, such as a cytotoxic cell. The molecule binding to the myostatin can be directly coupled to a 
substance having a toxic or therapeutic activity or may be indirectly linked to the substance. In 
5 one example, the toxicity of the molecule taken up by the cell is activated by myostatin protein. 

The invention also provides a diagnostic kit for identifying cells comprising a 
molecule which binds to a protein comprising an amino acid sequence shown in SEQ ID N02, 
for example, for incubation with a sample of tumor cells; means for detecting the molecule 
bound to the protein, unreacted protein or unbound molecule; means for determining the amount 
1 0 of protein in the sample; and means for comparing the amount of protein in the sample with a 
standard. Preferably, the molecule is a monoclonal antibody. In some embodiments of the 
invention, the detectability of the molecule which binds to myostatin is activated by said binding 
(e.g., change in fluorescence spectrum, loss of radioisotopic label). The diagnostic kit can also 
contain an instruction manual for use of the kit. 
1 5 The invention further provides a diagnostic kit for identifying cells comprising a 

nucleotide probe complementary to the sequence, or an oligonucleotide fragment thereof, 
shown in SEQ ID NO:1 , for example, for hybridization with mRNA from a sample of cells, e.g.. 
muscle cells; means for detecting the nucleotide probe bound to mRNA in the sample with a 
standard. In a particular aspect, the invention is a probe having a nucleic acid molecule 
20 sufficiently complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as 
to bind thereto under stringent conditions. "Stringent hybridization conditions" takes on its 
common meaning to a person skilled in the art here. Appropriate stringency conditions which 
promote nucleic acid hybridization, for example, 6x sodium chloride/sodium citrate (SSC) at 
about 45 e C are known to those skilled in the art. The following examples are found in Current 
25 Protocols in Molecular Biology, John Wiley & Sons. NY (1 989). 6.3.1 -6.3.6: For 50 ml of a first 
suitable hybridization solution, mix together 24 ml formamide. 12 ml 20x SSC. 0.5 ml 2 M 
Tris-HCI pH 7.6. 0.5 ml 1 0Ox Denhardt's solution. 2.5 ml deionized H 2 0. 1 0 ml 50% dextran 
sulfate, and 0.5 ml 10% SDS. A second suitable hybridization solution can be 1% crystalline 
BSA (fraction V), 1 mM EDTA, 0.5 M Na 2 HPO< pH 7.2, 7% SDS. The salt concentration in the 
30 wash step can be selected from a low stringency of about 2x SSC at 50 B C to a high stringency of 
about 0.2x SSC at 50°C. Both of these wash solutions may contain 0.1 % SDS. In addition, the 
temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22"C, to high stringency conditions, at about 65°C. The cited reference 
gives more detail, but appropriate wash stringency depends on degree of homology and length 
35 of probe. If homology is 100%, a high temperature (65°C to 75°C) may be used. If homology is 
low, lower wash temperatures must be used. However, if the probe is very short (<1 OObp), lower 
temperatures must be used even with 100% homology. In general, one starts washing at low 
temperatures (37°C to 40°C), and raises the temperature by 3-5°C intervals until background is 
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low enough not to be a major factor in autoradiography. The diagnostic kit can also contain an 
instruction manual for use of the kit. 

The invention provides a diagnostic kit which can be used to determine the 
genotype of mammalian genetic material, for example. One kit includes a set of primers used 
5 for amplifying the genetic material. A kit can contain a primer including a nucleotide sequence 
for amplifying a region of the genetic material containing one of the naturally occurring mutations 
described herein. Such a kit could also include a primer for amplifying the corresponding region 
of the normal gene that produces functional myostatic Usually, such a kit would also include 
another primer upstream or downstream of the region of interest complementary to a coding 

1 0 and/or non-coding portion of the gene. A particular kit includes a primer selected from a non- 
coding sequence of a myostatin gene. Examples of such primers are provided in Table 3, 
designated as Exon1-5\ Exon1-3', Exon2-5\ Exon3-5' and Exon3-3'. These primers are used to 
amplify the segment containing the mutation of interest. The actual genotyping is carried out 
using primers that target specific mutations described herein and that could function as allele- 

15 specific oligonucleotides in conventional hybridization, Taqman assays, OLE assays, etc. 
Alternatively, primers can be designed to permit genotyping by microsequencing. 

One kit of primers thus includes first, second and third primers, (a), (b) and (c), 
respectively. Primer (a) is based on a region containing a myostatin mutation, for example a 
region of the myostatin gene spanning the nt821del(11) deletion. Primer (b) encodes a region 

20 upstream or downstream of the region to be amplified by primer (a) so that genetic material 
containing the mutation is amplified, by PCR, for example, in the presence of the two primers. 
Primer (c) is based on the region corresponding to that on which primer (a) is based, but lacking 
the mutation. Thus, genetic material containing the non-mutated region will be amplified in the 
presence of primers (b) and (c). Genetic material homozygous for the wild type gene will thus 

25 provide amplified products in the presence of primers (b) and (c). Genetic material homozygous 
for the mutated gene will thus provide amplified products in the presence of primers (a) and (b). 
Heterozygous genetic material will provide amplified products in both cases. 

The invention provides purified proteins having biological activity of myostatin. 
The terms "isolated" and "purified" each refer to a protein substantially free of cellular material or 

30 culture medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. In certain preferred embodiments, the protein 
having biological activity of myostatin comprises an amino acid sequence identified as SEQ ID 
NO:2. Furthermore, proteins having biological activity of myostatin that are encoded by nucleic 
acids which hybridize under stringent conditions, as discussed above, to a nucleic acid 

35 comprising a nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:7 are encompassed 
by the invention. Proteins of the invention having myostatin activity can be obtained by expression 
in a suitable host cell using techniques known in the art. Suitable host cells include prokaryotic 
or eukaryotic organisms or cell lines, for example, yeast, E. co//, insect cells and COS 1 cells. 
The recombinant expression vectors of the invention, described above, can be used to express a 
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protein having myostatinl activity in a host cell in order to isolate the protein. The invention 
provides a method of preparing an purified protein of the invention comprising introducing into a 
host cell a recombinant nucleic acid encoding the protein, allowing the protein to be expressed in 
the host cell and isolating and purifying the protein. Preferably, the recombinant nucleic acid is a 
5 recombinant expression vector. Proteins can be isolated from a host cell expressing the protein 
and purified according to standard procedures of the art. including ammonium sulfate 
precipitation, column chromatography (e.g. ion exchange, gel filtration, affinity chromatography, 
etc.). electrophoresis, and ultimately, crystallization (see generally. "Enzyme Purification and 
Related Techniques", Methods in Enzymology, 22, 233-577 (1971)). 
1 0 Alternatively, the protein or parts thereof can be prepared by chemical synthesis 

using techniques well known in the chemistry of proteins such as solid phase synthesis 
(Merrifield, 1964). or synthesis in homogeneous solution (Houbenwcyl. 1987). 

The protein of the invention, or portions thereof, can be used to prepare 
antibodies specific for the proteins. Antibodies can be prepared which bind to a distinct epitope in 
1 5 an unconsented region of a particular protein. An unconserved region of the protein is one which 
does not have substantial sequence homology to other proteins, for example other members of 
the myostatin family or other members of the TGFB superfamily. Conventional methods can be 
used to prepare the antibodies. For example, by using a peptide of a myostatin protein, 
polyclonal antisera or monoclonal antibodies can be made using standard methods. A mammal. 
20 (e.g. a mouse, hamster, or rabbit) can be immunized with an immunogenic form of the peptide 
which elicits an antibody response in the mammal. Techniques for conferring immunogenic^ on 
a peptide include conjugation to carriers or other techniques well known in the art. For example, 
the peptide can be administered in the presence of adjuvant. The progress of immunization can 
be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 
25 immunoassay can be used to assess the levels of antibodies. Following immunization, antisera 
can be obtained and. if desired, polyclonal antibodies isolated from the sera. 

To produce monoclonal antibodies, antibody producing cells (lymphocytes) can 
be harvested from an immunized animal and fused with myeloma cells by standard somatic cell 
fusion procedures, thus immortalizing these cells and yielding hybridoma cells. Such techniques 
30 are well known in the art. For example, the hybridoma technique originally developed by Kohler 
and Milstein (Kohler. 1975) as well as other techniques such as the human B-cell hybridoma 
technique (Kozbor. 1983). the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole, 1985), and screening of combinatorial antibody libraries (Huse, 1989). 
Hybridoma cells can be screened immunochemically for production of antibodies specifically 
35 reactive with the peptide, and monoclonal antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with a protein having the biological activity of myostatin. or a peptide 
fragment thereof. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. For 
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example, F(ab*) 2 fragments can be generated by treating antibody with pepsin. The resulting 
F(ab\ fragment can be treated to reduce disulfide bridges to produce Fab* fragments. 

It is also known in the art to make chimeric antibody molecules with human 
constant regions. See, for example, Morrison et a/., Takeda et a/., Cabilly et a/., Boss et a/., 
5 Tanaguchi eta!., Teng etai (Morrison, 1985; Takeda, 1985; Cabilly; Boss; Tanaguchi; Teng, 
1982), European Patent Publication 0173494, United Kingdom Patent GB 2177096B, PCT 
Publication WO92/06193 and EP 0239400. It is expected that such chimeric antibodies would be 
less immunogenic in a human subject than the corresponding non-chimehc antibody. 

Another method of generating specific antibodies, or antibody fragments, 

10 reactive against protein having the biological activity of a myostatin protein, or a peptide fragment 
thereof, is to screen expression libraries encoding immunoglobulin genes, or portions thereof, 
expressed in bacteria, with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions can be expressed 
in bacteria using phage expression libraries. See for example Ward etai, Huse et a/., and 

15 McCafferty etai (Ward, 1989; Huse, 1989; McCafferty, 1990). Screening such libraries with, for 
example, a myostatin protein can identify immunoglobulin fragments reactive with myostatin. 
Alternatively, the SCID-hu mouse developed by Genpharm can be used to produce antibodies, 
or fragments thereof. 

The polyclonal, monoclonal or chimeric monoclonal antibodies can be used to 

20 detect the proteins of the invention, portions thereof or closely related isoforms in various 
biological materials, for example they can be used in an ELISA, radioimmunoassay or 
histochemical tests. Thus, the antibodies can be used to quantify the amount of a myostatin 
protein of the invention, portions thereof or closely related isoforms in a sample in order to 
determine the role of myostatin proteins in particular cellular events or pathological states. Using 

25 methods described hereinbefore, polyclonal, monoclonal antibodies, or chimeric monoclonal 
antibodies can be raised to nonconserved regions of myostatin and used to distinguish a 
particular myostatin from other proteins. 

The polyclonal or monoclonal antibodies can be coupled to a detectable 
substance or reporter system. The term "coupled" is used to mean that the detectable 

30 substance is physically linked to the antibody. Suitable detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
P-galactosidase, and acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 

35 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 

fluorescein, dansyl chloride and phycoerythrin; an example of a luminescent material includes 
luminol; and examples of suitable radioactive material include 125 l; 131 l, 35 S and 3 H. in a preferred 
embodiment, the reporter system allows quantitation of the amount of protein (antigen) present. 
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Such an antibody-linked reporter system could be used in a method for 
determining whether a fluid or tissue sample of a subject contains a deficient amount or an 
excessive amount of the protein. Given a normal threshold concentration of such a protein for a 
given type of subject, test kits could thus be developed. 
5 The present invention allows the skilled artisan to prepare bispecific antibodies 

and tetrameric antibody complexes. Bispecific antibodies can be prepared by forming hybrid 
hybridomas (Staerz, 1 986a &b). 

Compositions of the invention are administered to subjects in a biologically 
compatible form suitable for pharmaceutical administration in vivo. By "biologically compatible 
1 0 from suitable for administration in vivo" is meant a form of the composition to be administered in 
which any toxic effects are outweighed by the therapeutic effects of the composition. The term 
"subject" is intended to include living organisms in which a desired therapeutic response can be 
elicited, e.g. mammals. Examples of subjects include cattle, human, dogs, cats, mice, rats and 
transgenic species thereof. Administration of a therapeutically active amount of the therapeutic 
1 5 compositions of the present invention is defined as an amount effective, at dosages and for 
periods of time necessary to achieve the desired result. For example, a therapeutically active 
amount of a compound that inhibits the biological activity of myostatin protein may vary according 
to factors such as the age, sex, and weight of the individual, as well as target tissue and mode of 
delivery. Dosage regimes may be adjusted to provide the optimum therapeutic response. For 
20 example, several divided doses may be administered daily or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic situation. 

As far as the United States is concerned, this application is a Continuation-in- 
Part Application of prior United States Patent Application Serial No. 08/891,789, filed July 14, 
1997, the specification of which is incorporated herein by reference. 
25 Those skilled in the art will know, or be able to ascertain using no more 

than routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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CLAIMS 

1 . A method of increasing muscle mass of a mammal having muscle cells in which myostatin is 
expressed, the method comprising administering to the mammal an effective amount of a nucleic 
acid molecule substantially complementary to at least a portion of mRNA encoding the myostatin 

5 and being of sufficient length to sufficiently reduce expression of the myostatin to increase the 
muscle mass. 

2. The method of claim 1 wherein the mammal is bovine. 

3. A method of increasing muscle mass of a mammal, the method comprising administering to 
the mammal an effective amount of a nucleic acid molecule having ribozyme activity and a 

1 0 nucleotide sequence substantially complementary to at least a portion of mRNA encoding 
myostatin and being of sufficient length to bind selectively thereto to sufficiently reduce 
expression of the myostatin so as to increase the muscle mass. 

4. The method of claim 3 wherein the mammal is bovine. 

5. A diagnostic kit, for determining the presence of muscular hyperplasia in a mammal from 
15 which a sample containing DNA of the mammal has been obtained, the kit comprising: 

first and second primers for amplifying the DNA, the primers being complementary to 
nucleotide sequences of the DNA upstream and down stream, respectively, of a 
mutation in the portion of the DNA encoding myostatin which results in muscular 
hyperplasia of the mammal, wherein at least one of the nucleotide sequences is 
20 selected to be from a non-coding region of the myostatin gene. 

6. The diagnostic kit of claim 5, further comprising a third primer complementary to a naturally 
occurring mutation of a coding portion of the myostatin gene. 

7. A diagnostic kit, for determining the genotype of a sample of mammalian genetic material, the 
kit comprising: 

25 a pair of primers for amplifying a portion of the genetic material corresponding to a 

nucleotide sequence which encodes at least a portion of a myostatin protein, 
wherein a first of the primers includes a nucleotide sequence sufficiently 
complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic 
acid molecule containing the mutation, the mutation being selected from the group 

30 of mutations resulting from: (a) deletion of 1 1 nucleotides beginning at nucleotide 

821 of the coding portion of SEQ ID NO:1; (b) deletion of 7 nucleotides beginning at 
nucleotide 419 of the coding sequence and insertion of the sequence 
AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of the coding 
sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 

35 coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 

313 of the coding sequence and insertion of A in place thereof; and combinations 
thereof. 
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8. The diagnostic kit of claim 7 wherein a second of the pair of primers is located entirely 
upstream or entirety downstream of the selected mutation or mutations. 

9. The diagnostic kit of claim 8 wherein a first said primer spans mutation (a) and further 
comprising a third primer which is sufficiently complementary to the nucleotide sequence 

5 identified as SEQ ID NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID 
NO:i1. 

1 0. The diagnostic kit of claim 8 wherein a first said primer is sufficiently complementary to the 
inserted sequence of mutation (b) to prime amplification of a nucleic acid molecule containing 
mutation (b) and further comprising a third primer which is sufficiently complementary to the 

10 sequence corresponding to the 7 nucleotide deletion of mutation (b) to prime amplification of a 
nucleic acid molecule containing the 7 nucleotide deletion of mutation (b). 

11 . The diagnostic kit of claim 8 wherein a first said primer spans mutation (c) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (c) to prime amplification of a nucleic acid molecule 

15 lacking mutation (c). 

12. The diagnostic kit of claim 8 wherein a first said primer spans mutation (d) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (d) to prime amplification of a nucleic acid molecule 
lacking mutation (d). 

20 1 3. The diagnostic kit of claim 8 wherein a first said primer spans mutation (e) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (e) to prime amplification of a nucleic acid molecule 
lacking mutation (e). 

14. A method for determining the presence of muscular hyperplasia in a bovine animal, the 

25 method comprising: 

obtaining a sample of material containing DNA from a said animal; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
wherein the absence of DNA having said nucleotide sequence indicates the presence of 
30 muscular hyperplasia in the animal. 

15. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the DNA in the 
presence of primers based on a nucleotide sequence encoding a protein having biological activity 
of myostatin. 

35 16. The method of claim 15 wherein DNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 
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17. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 
5 18. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of first and second primers based on first and second nucleotide sequences 
encoding spaced apart regions of the protein, wherein said regions flank a mutation known to 
naturally occur and which when present in both alleles of a said animal results in said muscular 
10 hyperplasia. 

19. The method of claim 18 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 

1 5 base pair deletion beginning at base pair no. 821 , and said first primer is selected to be upstream 
of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

20. The method of claim 14 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 

20 sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 
base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing said 
deletion. 

25 21 . The method of claim 19 wherein the animal is of a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

22. The method of claim 20 wherein the animal is a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

23. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
30 encoding a protein having biological activity of myostatin is present includes amplifying the DNA 

in the presence of a primer containing at least a portion of a mutation known to naturally occur 
and which when present in both alleles of a said animal results in said muscular hyperplasia. 

24. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 

35 obtaining a sample of material containing DNA from a said animal; and 

ascertaining whether DNA having a mutation as defined in claim 7 is present; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
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wherein the absence of DNA having said nucleotide sequence and presence of a said mutation 
indicates the presence of muscular hyperplasia in the animal. 

25. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 
5 obtaining a sample of the animal containing mRNA; and 

ascertaining whether an mRNA encoding a protein having biological activity of myostatin 
is present in the sample, 
wherein the absence of said mRNA indicates the presence of muscular hyperplasia in the 
animal. 

1 0 26. The method of claim 25 wherein the sample is of muscle tissue or wherein the tissue is 
skeletal muscle tissue. 

27. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the mRNA in the 
presence of primers substantially complementary to the nucleotide sequence encoding the 

15 protein. 

28. The method of claim 27 wherein mRNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 

20 29. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of primers substantially complementary to a nucleotide sequence encoding the 
N-terminal and the C-terminal, respectively, of the protein having biological activity of myostatin. 
30. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 

25 encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of first and second primers substantially complementary to first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein said regions flank a 
mutation known to naturally occur and which when present in both alleles of a said animal results 
in said muscular hyperplasia. 

30 31. The method of claim 30 wherein an mRNA of said animal not displaying muscular 

hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 
nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1-base pair deletion beginning at base pair no. 821 , and said first primer is selected to be 

35 upstream of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

32. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of a primer containing a nucleotide sequence complementary to at least a 
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portion of a mutation known to naturally occur in a said animal and which when present in both 
alleles of a said animal results in said muscular hyperplasia. 

33. The method of claim 32 wherein an mRNA of said animal not displaying muscular 
hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 

5 nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 1 and said primer is selected to span the 
deleted portion. 

34. The method of claim 31 wherein the animal is of a breed selected from Belgian Blue, 
10 Astu nana, Parthenaise and Rubia Gallega. 

35. A method for determining the presence of muscular hyperplasia in a mammal, the method 
comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity 
15 of myostatin, is present, and whether a sequence of the DNA encoding (b) an allelic 

protein lacking the activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 

36. The method of claim 35 wherein (b) contains a naturally occurring mutation responsible for 
20 the lack of activity. 

37. The method of claim 35 wherein the mammal is a human. 

38. The method of claim 37 wherein ascertaining whether a sequence of the DNA encoding (a) 
is present, and whether a sequence of the DNA encoding (b) is present includes amplifying the 
DNA 

25 in the presence of primers based on a nucleotide sequence encoding a protein having biological 
activity of myostatin. 

39. The method of claim 38 wherein said primers are based on the sequence identified as SEQ 
ID NO:7. 

40. A method for determining the presence of muscular hyperplasia in a mammal, the method 
30 comprising: 

obtaining a sample of material containing mRNA from the mammal; and 

ascertaining whether a sequence of the mRNA encoding (a) a protein having biological 

activity of myostatin, is present, and whether a sequence of the mRNA encoding 

(b) a protein at least partially encoded by a truncated nucleotide sequence 
35 corresponding to substantially the sequence of the mRNA and lacking the 

activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 
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41 . The method of claim 40 wherein the mRNA encoding (a) and the truncated sequence 
correspond to alleles of DNA of the mammal. 

42. The method of claim 40 wherein the mammal is human. 

43. The method of claim 42 wherein ascertaining whether a sequence of the mRNA encoding 

5 (a) is present, and whether a sequence of the mRNA encoding (b) is present includes amplifying 
the mRNA in the presence of a pair of primers complementary to a nucleotide sequence 
encoding a protein having biological activity of myostatin. 

44. The method of claim 43 wherein each said primer contains a truncated nucleotide sequence 
substantially complementary to a portion of the sequence identified as SEQ ID NO:7. 

1 0 45. The method of claim 44 wherein the truncated sequence contains at least 50 consecutive 
nucleotides substantially corresponding to about 10. or between about 10 and 20. or between 
about 20 and 30. or between about 30 and 40. or between about 40 and 50 consecutive 
nucleotides of SEQ ID NO:7. 

46. A method for determining the presence of muscular hyperplasia in a mammal, the method 
15 comprising: 

obtaining a tissue sample of containing mRNA of the mammal; and 
ascertaining whether an mRNA encoding a mutant type myostatin protein lacking 
biological activity of myostatin is present, 
wherein the presence of a said mRNA encoding a mutant type myostatin protein indicates the 
20 presence of muscular hyperplasia in the mammal. 

47. The method of claim 46 wherein the mutant type myostatin protein lacing biological activity is 
encoded by a naturally occurring allele of DNA encoding the mRNA. 

48. A method for determining the presence of double muscling in a bovine animal, the method 
comprising: 

25 obtaining a sample of material containing DNA from the animal; and 

ascertaining whether the DNA contains the nucleotide coding sequence identified as 
SEQ ID NO:11, 

wherein absence of the sequence indicates double muscling in the animal. 

49. The method of claim 34 wherein the animal is of a breed selected from Belgian Blue, 
30 Asturiana, Parthenaise and Rubia Gallega. 

50. A method for determining the myostatin genotype of a mammal, comprising: 

obtaining a sample of material containing nucleic acid of the mammal, wherein the nucleic 

acid is uncontaminated by heterologous nucleic acid; 
ascertaining whether the sample contains a (i) nucleic acid molecule encoding a protein 
35 having biological activity of myostatin; and 

ascertaining whether the sample contains an (ii) allelic nucleic acid molecule encoding a 
protein lacking biological activity of myostatin. 

51. The method of claim 50 wherein the mammal is human and (i) comprises a nucleic acid 
sequence substantially homologous with the sequence identified as SEQ ID NO:7. 



BNSDOCID: <WO 9902667A1_L> 



WO 99/02667 



-43- 



PCT7IB98/01197 



52. A purified protein having biological activity of myostatin, and having an amino acid sequence 
identified as SEQ ID IMO:2, or a conservatively substituted variant thereof. 

53. An isolated nucleic acid molecule encoding a protein of claim 52. 

54. An isolated nucleic acid molecule comprising a DNA molecule having the nucleotide 

5 sequence identified as SEQ ID NO:1 or which varies from the sequence due to the degeneracy 
of the genetic code, or a nucleic acid strand capable of hybridizing with at least one said nucleic 
acid molecule under stringent hybridization conditions. 

55. Isolated mRNA transcribed from DNA having a sequence which corresponds to a nucleic 
acid molecule according to claim 54. 

10 55. Isolated DNA having a sequence according to claim 54 in a recombinant cloning vector. 

57. A microbial cell containing and expressing heterologous DNA which is complementary a 
nucleic acid molecule of claim 54. 

58. A transfected cell line which expresses a protein of claim 52. 

59. A process for producing the protein of claim 52 comprising: 

15 preparing a DNA fragment including a nucleotide sequence which encodes said protein; 

incorporating the DNA fragment into an expression vector to obtain a recombinant DNA 
molecule which includes the DNA fragment and is capable of undergoing 
replication; 

transforming a host cell with said recombinant DNA molecule to produce a transformant 
20 which can express said protein; 

culturing the transformant to produce said protein; and 
recovering said protein from resulting cultured mixture. 

60. A method of increasing muscle mass in a mammal, comprising administering an effective 
amount of an antibody to myostatin to said mammal. 

25 61. A method of increasing muscle mass in a mammal, comprising raising an autoantibody to 
the myostatin the in the mammal. 

62. The method of claim 61 wherein raising the autoantibody includes administering a protein 
having myostatin activity to the mammal. 

63. A method of increasing muscle mass in a mammal in need thereof, comprising 

30 administering to the mammal an effective amount of an antisense nucleic acid or oligonucleotide 
substantially complementary to at least a portion of the sequence identified as SEQ ID NO:1 or 
SEQ ID NO:5, or SEQ ID NO:7. 

64. The method of claim 63 wherein the portion is at least 5 nucleotide bases in length. 

65. The method of claim 64 wherein the mammal is a bovine and the sequence is the sequence 
35 identified as SEQ ID NO:1 . 

66. A method of increasing muscle mass in a mammal, comprising administering to the 
mammal an effective amount of an antibody to the myostatin. 

67. A probe comprising a nucleic acid molecule sufficiently complementary with a sequence 
identified as SEQ ID NO:1 , or its complement, so as to bind thereto under stringent conditions. 
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68. The probe of claim 67 wherein the sequence is between about 8 and about 1195 
nucleotides in length, or between about 1 5 and 1 1 95 nucleotides in length, or between about 25 
and 1 195 nucleotides in length, or between about 35 and 1 195 nucleotides in length, or between 
about 45 and 1195 nucleotides in length, or between about 55 and 1 195 nucleotides in length, or 
5 between about 65 and 1 195 nucleotides in length, or between about 75 and 1 195 nucleotides in 
length, or between about 75 and 1 195 nucleotides in length, or between about 85 and 1 195 
nucleotides in length, or between about 95 and 1 195 nucleotides in length, or between about 105 
and 1195 nucleotides in length, or between about 1 15 and 1195 nucleotides in length. 
69. A method for identifying a nucleotide sequence of a mutant gene encoding a myostatin 
1 0 protein of a mammal displaying muscular hyperplasia, the method comprising: 
obtaining a sample of material containing DNA from the mammal; and 
probing the sample using a nucleic acid probe based on a nucleotide sequence of a 

known gene encoding myostatin in order to identify nucleotide sequence of the 
mutant gene. 

15 70. The method of claim 69, wherein the probe is based on a nucleotide sequence of a non- 
coding region of the gene. 

71 . The method of claim 70 wherein the probe is based on SEQ ID NO:54. 

72. The method of claim 71 wherein the probe is at least 8 nucleic acids in length. 

73. The method of claim 69, wherein the step of probing the sample includes exposing the DNA 
20 to the probe under hybridizing conditions and further comprising isolating hybridized nucleic acid 

molecules. 

74. The method of claim 73, further comprising the step of sequencing isolated DNA. 

75. The method of claim 69. wherein the mammal is a bovine mammal and the probe is based 
on a said nucleotide sequence identified as SEQ ID NO:1. 

25 76. The method of claim 74. further comprising the step of isolating and sequencing a cDNA or 
mRNA encoding the complete mutant myostatin protein. 

77. The method of claim 71 . further comprising the step of isolating and sequencing a functional 
wild type myostatin from a said mammal not displaying muscular hyperplasia. 

78. The method of claim 76, further comprising comparing the complete coding sequence of the 
30 complete mutant myostatin protein with, if the coding sequence for a functional wild type 

myostatin from a said mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from a said mammal is previously unknown, (2) the 
sequence determined according to claim 74 or claim 77, to determine the location of any 
mutation in the mutant gene. 
35 79. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether the DNA contains a said mutation determined according to claim 
78. 
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80. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing mRNA from the mammal; and 
ascertaining whether the mRNA contains a said mutation determined according to claim 
5 78. 

81 . A primer composition useful for the detection of a nucleotide sequence encoding a myostatin 
comprising a first nucleic acid molecule based on a nucleotide sequence located upstream of a 
said mutation determined according to claim 78 and a second nucleic acid molecule based on a 
nucleotide sequence located downstream of the mutation. 

10 82. A probe comprising a nucleic acid molecule based on a nucleotide sequence of claim 74 or 
claim 76 and spanning a said mutation determined according to claim 78. 
83. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
phenotype being conferred by a transgene contained in the somatic and germ cells of the 
mammal, the transgene encoding a myostatin protein having a dominant negative mutation. 

15 84. The transgenic mammal of claim 83 wherein the mammal is male and non-human and the 
transgene is located on the Y chromosome. 

85. The transgenic mammal of claim 83 wherein the mammal is bovine and the transgene is 
located to be under the control of a promoter which normally a promoter of a myosin gene. 

86. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
20 phenotype being conferred by a transgene having a sequence antisense to that encoding a 

myostatin protein of the mammal. 

87. The transgenic mammal of claim 86 wherein the mammal is bovine and the transgene is 
located on the Y chromosome. 

88. The transgenic mammal of claim 86 wherein the transgene further comprises a sequence 
25 which when transcribed obtains an mRNA having ribozyme activity. 

89. A transgenic non-human mammal having a phenotype characterized by muscular 
hyperplasia, said phenotype being inducible and being conferred by a myostatin gene flanked by 
J oxP sides and a Cre transgene under the dependence of an inducible promoter. 

90. A transgenic non-human male mammal having a phenotype characterized by muscular 
30 hyperplasia , said phenotype being conferred by a myostatin gene flanked by J oxP sides and a 

Cre transgene located on the Y chromosome. 

91 . A method for determining whether a sample of mammalian genetic material is capable of a 
conferring a phenotype characterized by muscular hyperplasia, comprising ascertaining whether 
the genetic material contains a nucleotide sequence encoding a protein having biological activity 

35 of myostatin, wherein the absence of said sequence indicates the presence of muscular 
hyperplasia in the animal. 

92. A transgenic bovine having a genome lacking a gene encoding a protein having biological 
activity of myostatin. 
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93. A transgenic mouse having a genome containing a gene encoding a human protein having 
biological activity of myostatin or containing a gene encoding a bovine protein having biological 
activity of myostatin. 

94. A transgenic bovine having a gene encoding a bovine protein having biological activity of 
5 myostatin and heterologous nucleotide sequence antisense to the gene. 

95. A transgenic bovine of claim 94, further comprising a gene encoding a nucleic acid 
sequence having ribozyme activity and in transcriptional association with the nucleotide sequence 
antisense to the gene. 
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Fig. 4 
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SEQUENCE ID NO. 1 



PCT/IB98/01197 



1 AGGAAGAATA AGAACAAGGG AAAAGATTGT ATTGATTTTA AAACCATGCA 
51 AAAACTGCAA ATCTCTGTTT ATATTTACCT ATCTATGCTG ATTGTTGCTO 
101 GCCCAGTGGA TCTGAATGAG AACAGCGAGC AGAAGGAAAA TGTGCAAAAA 
151 GAGGGGCTCT GTAATGCATG TTTGTGGAGG GAAAACACTA CATCCTCAAG 
201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAAC1TCGC CTGGAAACAG 
2 51 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AAC'ITTTGCC CAAGGCTCCT 
301 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAG ATCCCAGCAG 
351 TGACGGCTCC TTGGAAGACG ATGACTACCA CGCCACGACG GAAACCGTCA 
401 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGCAAAACCC 

451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 
501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 

551 TGTTTGTGCA AATCCTGACA CTCATCAAAC CCA'I'GAAAGA CGGTACAAGG 

601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 

651 TTCCCAGAGC ATTGATGTGA AGACACTGTT GCAGAACTGG CTCAAACAAC 

701 CTGAATCCAA CTTAGGCATT GAAATCAAAG OriTAGATCA GAATGGCCAT 

751 GATCTTGCTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT 

801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGA1TTTG 

851 GGCTTGATTG TGATGAACAC TCCAOAGAAT CTCGATGCTG TCUTTACCCT 

901 CTAACTGT GG ATTTTGAAGC TTTTGGATGG GATTGGATTA TTGCACC TAA 

951 AAGATATAAG GCCAATTACT GCTCTGGAGA ATGTGAATTT GTATTTTTGC 

1001 AAAAGTATCC TCATACCCAT CTTGTGCACC AAGCAAACCC CAGACCTTCA 

1051 CCCGGCCCCT GCTGTACTCC TACAAAGATG TCTCC AATTA ATATGCi'ATA 

1101 TTTTAATGGC CAAGGACAAA TAATATACGG GAAGATTCCA GCCA'I'CCJTAG 

1151 TAGATCGCTG TGGGTGTTCA TGAGTCTATA TTtGGgTTCA 'J'AAGC 
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SEQUENCE ID NO. 2 



1 fMQKLQISVY IYLPMLIVAQ PVDLNENSEQ KENVEKEC3LC NAC I.. WRENTT 

51 SSKLEATKIQ ILSKI.RLETA PNISKnAIRQ LLPKAPFLLE LIDQFDVQRD 

101 ASSDGSLEDD DYIIARTETVI TMPTESDLLT gVEGKPKCCP FK7SSKIQYN 

151 KT.VKAQLWIY L.R FVKTPATV FVQILRr.TKP MKIXJTFYTGI KSLKLDMNPG 

201 TGTWQSIDVK TVDQNWLKQP ESNLGIETKA LDliNGHDTjAV TFPEPGEDGL 

251 TPFLEVKVTD TPKRSRRDFC LDCDEHSTES RCCKYPI/TVn FEAFCWDWTI 

3 01 APKRVKANYC SGECEFVFLQ KYPHTKLVHQ ANPRGSAGPC CTPTKMSPIN 

351 KOYFNGECQI TYGKIPAMW DRCGCS* Wl I HHW 
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SEQUENCE ID NO. 3 

1 AGGAAGAATA AGAACAAGGG AAAAGATTGT ATTGATH "IT A AAACCATGCA 

51 AAAACTCCAA ATCTCTGTTT ATATTTACCT ATTTATGCl U ATTGTTGCTG 

101 GCCCAGTCCA TCTGAATGAG AACAGCGAGC AGAAGGAAAA ' I fJTGGAAAAA 

15? CAGGGGCTGT GTAATGCATG TTTGTGGAGG GAAAACACTA CATCCTCAAG 

201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAACTTCGC CTGGAAACAG 

251 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTTTTUCJC CAAGGCTCC1' 

301 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAC ATGCCAGCAG 

351 TGACGGCTCC TTGCAAGACG ATGACTACCA CGCCAGGAf!G GAAACGGTCA 

£01 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGCAAAACCC 

451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 

501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 

551 TGTTTGTGCA AATCCTCACA CTCATCAAAC CCATGAAAGA CGGTACAAGO 

601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 

651 TTGGCAGAGC ATTGATGTGA AGACAGTGTT GCAGAACTGG CTCAAACAAC 

701 CTGAATCCAA CTTACGCATT GAAATCAAAG CTTTAGATGA GAATGGCCAT 

751 GATCTTGCTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT 

801 TTTAGAAGTC AAGGTAACAG AC AC AC C AAA AAGATCTAGG AGAGATTTTG 

851 GGCTTGATTG TGACAGAATC TCGATGCTGT CGTTACCCTC TAACTGTGGA 

901 TTTTGAAGCT TTTGGATGGG ATTGGATTAT TCCACCTAAA AGATATAAGG 

951 CCAATTACTG CTCTGGAGAA TGTGAATTTG TATTTTTGCA AAAGTATCCT 

1001 CATACCCATC TTGTGCACCA AGCAAACCCC AGAGGTTCAG CCGGCCCCTC 

1051 CTGTACTCCT ACAAAGATGT CTCCAATTAA TATGCTATAT TTTAATGGCG 

1101 AAGGACAAAT AATATACGGG AACATTCCAG CC ATGGTAGT AAATCGCTGT 

1151 GGGTGTTCAT GAGGTCTATA TTTGGTTCAT AGCTTCCTCA AACATGGAAG 

1201 GTCTTCCCCT CAACAATTTT GAAACTGTTG AAATTATGT 
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SEQUENCE ID NO. 4 

1 JmQKLQISVY I YLFML IVAG PVDLNENSEQ KENVEKEGLC NACLWK KNTT 

51 SSRLEAIKIQ ILSKLRLETA PNISKDAIRQ I.LPKAPPLLB LlbyFDVQRD 

101 ASSDGSLEDD DYHARTETVI TMPTESDLLT ©VEGKPKCCF PK KSb'KIQYN 

151 KLVKAQLWIY LR PVKTPATV FVQILRLXKP MKDGTRYTGI RST.K l.UMUPG 

201 TGIWQSIDVK TVLQNWLKQP ESNFrfSTEIKA LDENGHDI/AV TFPEPGttDGl, 

251 TPFLEVKVTD TPKR.SRRDFG TjDCDRISMLS LPSNCGF 
301 
351 
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SEQUENCE ID NO. 5 



PCT/IB98/01197 



1 


GTCTCTCGGA CGGTXCATGC ACTAATAXTT CACTTGGCAT TACTCAAAAG 


SI 


CAAAAAGAAG 


AAATAAGAAC 


AAGCGAAAAA 


AAAAGATTGT 


GCTGATTTTT 


101 


AAAATGATGC 


AAAAACTGCA 


AATGTATGTT 


TATATTTACC 


TCTTCAl*GCT 


151 


GATTGCTGCT 


GGCCCAGTGG 


ATCTAAATGA 


GGGCAGTGAG AGAGAAGAAA 


201 


ATGTGGAAAA 


AGAGGGGCTC 


TGTAATGCAT 


GTGCGTGGAG 


ACAAAACACG 


251 


AGGTACTCCA 


GAATAGAACC 


CATAAAAATT 


CAAATCCTCA 


GTAAGCTGCG 


301 


CCTGGAAACA 


GOTCCTAACA 


TCAGCAAAGA 


TGCTATAAGA 


CAACTTCTGC 


351 


CAAGACCGCC 


TCCACTCCCG 


GAACTGATCG 


ATCAGTACGA 


CGTCCAGAGG 


401 


GATGACAGCA 


GTGATGGCTC 


TTTGGAAGAT 


GACGATTATC 


ACGCTACCAC 


451 


GGAAACAATC 


ATTACCATGC 


CTACAGAGTC 


TGACTTTCTA 


ATGCAAGCGG 


501 


ATGGCAAGCC 


CAAATGTTGC 


TTTTTTAAAT 


TTAGCTCTAA 


AATACAGTAC 


551 


AACAAAGTAG 


TAAAAGCCCA 


ACTGTGGATA 


TATCTCAGAC 


CCGTCAAGAC 


601 


TCCTACAACA 


GTGTTTCTGC 


AAATCCTGAG 


ACTCATCAAA 


CCCATGAAAG 


651 


ACCCTACAAG 


GTATACTGGA 


ATCCGATCTC 


TOAAACTTGA 


CATGAGCCCA 


701 


GGCACTGGTA 


TTTGGCAGAG 


TATTGATGTG 


AAGACAGTGT 


TGCAAAATTG 


751 


GCTCAAACAG 


CCTGAATCCA 


ACTTAGGCAT 


TGAAATCAAA 


GCTTTGGATG 


801 


AGAATGGCCA 


TGATCTTCCT 


GTAACCTTCC 


CAGGACCAGG 


AGAAGATGGG 


851 


CTGAATCCCT 


TTTTAGAACT 


CAAGGTGACA 


GACACACCCA 


AGAGGTCCCG 


901 


GAG AG AC TTT 


GGGCTTGACT 


GCGATGAGCA 


CTCCACGGAA 


TCCCGGTGCT 


951 


GCCGCTACCC 


CCTCACGGTC 


GATT1TGAAG 


CCTTTGGATG 


GGACTGGATT 


1001 


ATCGCACCCA 


AAAGATATAA 


GGCCAATTAC 


TGCTCAGGAG 


AGTGTCiAATT 


1051 


TGTGTTTTTA 


CAAAAATATC 


COCATACTCA 


TCTTGTGCAC 


CAAGCAAACC 


1101 


CCAGAGGCTC 


AGCAGGCCCT 


TGCTGCACTC 


CGACAAAAAT 


GTCTCCCATT 


1151 


/ -aTATGCTAT 


ATTTTAATGG 


CAAAGAACAA 


ATAATATATG 


GCJAAAATTCC 


1201 


AGCCATGGTA 


GTAGACCGCT 


CTCG GTGCTC 


ATGAGCTTTG 


CATTAGGTTA 


1251 


GAAACTTCCC 


AAGTCATGGA 


AGGTCTTCCC 


CTCAATTTCG 


AAACTGTGAA 
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1301 TTCAAOCACC ACAGGCTGTA GGCCTTGAGT ATGCTCTACT AACGTAAGCA 

1351 CAAGCTACAG TGTATGAACT AAAAGAOAGA ATAGATGCAA TGGTTGGCAT 

1401 TCAACCACCA AAATAAACCA TACTATAGGA TGTTGTATGA TTTCCAGAGT 

1451 TTTTGAAATA GATGGAGATC AAATTACATT TATC3TCCATA TATOTATATT 
1501 ACAACTACAA TCTAGCCAAG GAAGTGAGAG CACATCTTGT GGTCTGCTUA 
1551 GTTAGGAGGG TATGATTAAA AGGTAAAGTC TTATTTCCTA ACAGTTTCAC 

1601 TTAATATTTA CAGAACAATC TATATGTAGC CTTTGTAAAG TGTAGGATTG 

1651 TTATCATTTA AAAACATCAT GTACACTTAT ATTTGTATTG T ATACTTG GT 

1701 AAGATAAAAT TCCACAAAGT AGGAATGGGG CCTCAOATAC ACATTGCCAT 

1751 TCCTATTATA ATTGGACAAT CCACCACGGT GCTAATGCAG TGCTCAATGG 

IB 01 CTCCTACTGG ACCTCTCGAT AGAACACTCT ACAAAGTACG AGTCTCTCTC 

1851 TCCCTTCCAG GTGCATCTCC ACACACACAG CACTAAGTGT TCAATGCATT 

1901 TTCTTTAAGG AAAGAAGAAT CTTTTTTTCT AGAGGTCAAC TTTCAGTCAA 

1951 CTCTAGCACA GCGGGAGTGA CTGCTGCATC TTAAAAGGCA GCCAAACAGT 

2001 ATTCATTTTT TAATCTAAAT TTCAAAATCA CTGTCTGCCT TTATCACATG 

7051 CCAATTTTGT GGTAAAATAA TGGAAATGAC TGGTTCTATC AATATTGTAT 

2101 AAAAGACTCT CAAACAATTA CATTTATATA ATATGTATAC AATATTGTTT 

2151 TGTAAATAAG TGTCTCCTTT TATATTTACT TTGGTATATT TTTACACTAA 

22 01 TGAAATTTCA AATCATTAAA GTACAAAGAC ATGTCATGTA TCACAAAAAA 

2251 GGTGACTGCT TCTATTTCAG AGTGAATTAG CAGATTCAAT AGTGGTCTTA 

2301 AAACTCTGTA TGTTAAGATT AGAAGGTTAT ATTACAATCA ATTTATGTAT 

2351 TTTTTACATT ATCAACTTAT GGTTTCATGG TGGCTGTATC TATGAATGTG 

2401 GCTCCCAGTC AAATTTCAAT GCCCCACCAT TTTAAAAATT A CAAGC ATT A 

2451 CTAAACATAC CAACATGTAT CTAAAGAAAT ACAAATATGG TATCTCAATA 

2501 ACAGCTACTT TTTTATTTTA T AATTTG AC A ATGAATACAT TTCTTTTATT 

2551 TACTTCAGTT TTATAAATTG GAACTTTGTT TATCAAATGT ATTGTACTCA 

2601 TAGCTAAATG AAATTATTTC TTACATAAAA ATGTGTAGAA ACTATAAATT 

2651 AAAGTGTTTT CACATTTTTG AAACCC 
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SEQUENCE ID NO. 6 



1 


MMQKLQWYVY 


IYLiFMLIAAG 


PVDLNEGSER 


EETTVEKEGDC 


NACAWRQNTR 


51 


YSRIEAIKIQ 


IL.SKLRL.ETA 


PNISKDAIRQ 


LLPRAPPLRE 


LIDQYDVQRD 


101 


DSSnGST.EDD 


DYHATTETII 


TMPTESDFLM 


OADGKPKCCF 


FKFSSKIQYN 


151 


KWXAQLWIY 


LRPVKTPTTV 


rVQIliRIiXKP 


MKDGTRYTGT 


RST.KT.DMSPG 


201 


TGTWQSIDVK 


TVLQNWLKQP 


ESNLGIEIKA 


LiDENGHDLAV 


TFPGPGEDGL 


251 


NPFLEVKVTD 


TPKRSRRDFG 


LDCDEHSTES 


RCCRYPLTVD 


FEAFGWDWII 


301 


APKRYKANYC 


SGECh.1* VI- LQ 


KYPHTHLVHQ 


ANPRGSAGPC 


CTPTKMSPIN 


351 


MLYFNGKEOI 


IYGKIPAMW 


DRCGCS* 
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SEQUENCE ID NO. 7 



PCT/IB98/01197 



ha3753 NGATTTTCTAATrtCAAGTOGATCCAAAACCC 



he 3 7 5 3 AAATCTTCCTTCTTTAAATTTAOCTCTAAAATACAATACAATAAACTACTAAAGGCCCAA 



b«37S3 CTATCCATATATTTGAGACCCGTCGAQACTCCTACAACAOTCJTTTCT 



hc37 5 3 CTCATCAAACCTATGAJLA<LkC(>OTACAACk#rATACTGGAATCCGATCTCTGAAACTrQAC 
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h«3753 ATCWU^CACQC^CTCOTATCTtt^^ 
in3753 CTTCAAACJU^CTOAAXCC/LACT^^ 

b*3753 CWVTCTTaCTtXAACCXTCCCACCAC^CX^^ 

h«37 5 3 CCT^CXA G<^AAC/WGACACAfyiX XAAAQA.TTCCAOAAC?CGAT rTT C GGTfrTT(jACTW*TGA 



h«37 5 3 TGAGCACTCAACAOAATCACC^TCCTCTCCT^ 
ho 37 5 3 TXCCGATCOGXTTaOA - TATCa 



^ 0 762 3 AOccaATaaTAcaTAaACcacTOTOooTacrrc! 



hs7 « 2 3 ATOAGATTTATATTAAGCCTTCATAACTrCCTAAAACATGGAAMTTTTCCCCTCAACAA 
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h*7823 TTTTCAAaCTCWAAATTAAaTACCA 



h«7823 AAO<^TAAaCTACACTATCTAAACTAAAAGG«^ 



Kft?Q23 TTTAACCATCCAAACAAATCATAC - * CAGAAAGTTTTATGATTTCCAJ^GTTTTTTNAOG 



h3782^ CNAOAAAOOAOGAOTCAAAWTTTCANTCTTAXaaT 



1152027 ATTTCG GCACAGOTNAAACACTT GAA TTTATATTGTATGGTA GTATA 



h9 2 02 7 CTTGGT AAGATAAAATTCCAC AAAAAT A OO OATG QTG GA OGAT AT Q CA - ATTTCCATTCG 



h9 2027 TATTATAATTGAC^CAOTACATTAACAATCCATTCCAACCGTCCTAATACGATAOGCTGA 
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n9 5 3 2 7 TJUUtTCTC*ACOWCCATTAT*WAATA 



n9 5327 ATTOACTCTATTATC -TO- AAATOAAO - AA7AAACTOATOCXATCTCAACAA7AACTGTT 



nS 5 3 2 7 A LTririri Jl Tl w lU rATAATTTIXlA 



n$ 5 32 7 AATTOCCATTTTGTTAATCAAATTTATTCTACT - ATGACTAAATGAAATTATTTCTTACA 



nS 5 327 T - CTAA TTT OTAOAAACACTrATAACITTAT ATTAAAGTOTTTTCACATTTTTTTaAAAaAC 
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SEQUENCE ED NO. 8 



1 


MQKLQLCVYI 


YLFMLXVAGP 


VDLNENSEQK 


ENVEKEGLCN ACTWRQNTKS 


51 


SRIEAIKIQI 


LSKLRLETAP 


NISKDVIRQL 


LPKAPPLREL 


IDQYDVQRDD 


101 


SSDGSLEDDD 


YHATTETI IT 


MPTESDFLMQ 


VDGKPKCCFF 


KFSSKIQYNK 


151 


WKAQLWIYL 


RPVETPTTVF 


VQILRLIKPM 


KDGTRYTGIR 


SLKLDMNPGT 


201 


GIWQSIDVKT 


VLQNWLKQPE 


SNLGIEIKAL 


DENGHDLAVT 


FPGPGEDGLN 


251 


PFLEVKVTDT 


PRRSRRDFGL 


DCDEHSTESR 


CCRYPLTVDF 


EAFGWDWIIA 


301 


PKRYKANYCS 


GECEPVFLQK 


YPHTHLVHQA 


NPRGSAGPCC 


TPTKMSPINM 


351 


LYFNGKEQII 


YGKIPAMWD 


RCGCS* 
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SEQUENCE ID NO. 54 

1 GCGGCCGCCC GGGCAGGTAT CGAAAGTTTC ACATATAAAG AT GAAT AAGA 

51 TCTAAGTGTA TATGTTATTG TTAATAAAGT TTTTAATTTT TCGAATGTCA 

101 CATACAGCCT TTATTATTCA TAGATTTATT CCTTTTAAGA AGTAGTCAAA 

151 TGAATCAGCT CACCCTTGAC TGTAACAAAA TACTGTTTGG TGACTTGTGA 

2 01 CAGACAGGGT TTTAACCTCT GACAGCGAGA TTCATTGTGG AGCAAGAGCC 

2 51 AATCACAGAT CCCGACGACA CTTGTCTCATCAAAGTTGGA ATATAAAAAG 

301 CCACTTGGAA T AC AG T AT AAAAG&T T C AC T GGTGTGGCAA GTTGTCTCTA 

351 GACTGGGCAG GCATTAACGT TTGGCTTGGC GTTACTCAAA AGCAAAAGAA 

4 01 AAGTAAAAGG AAGAAGTAAG AACAAGGGAA AAGATTGTAT TGATTTTAAA 

4 51 AC C ATG.C AAA AACTGCAAAT CTCTGTTTAT ATTTACCTAT TTATGCTGAT 

501 TGTTGCTGGC CCAGTGGATC T GAAT GAG AA. CAGCGAGCAG AAGGAAAATG 

551 TGGAAAAAGA GGGGCTGTGT AATGCATGTT TGTGGAGGGA AAACACTACA 

601 TCCTCAAGAC TAGAAGCCAT AAAAATCCAA ATCCTCAGTA AACTTCGCCT 

651 GGAAACAGCT C C TAACAT C A GCAAAGATGC TATCAGACAA CTTTTGCCCA 

7 01 AGGCTCCTCC ACTCCTGGAA C T GAT T GAT C AGTTCGATGT C C AG AG AG AT 

7 51 GCCAGCAGTG ACGGCTCCTT GGAAGACGAT G AC T AC C AC G CCAGGACGGA 

8 01 AACGGTCATT ACCATGCCCA CGGAGT/GTGA GTAGTCCTGCTGGTGCAAAG 
851 CAACGACTCT GCTGACTGCT GTTCTAGTGT TCATGAAAAA CCGATCTATT 
901 TTCAGGCTCT TTTAACAAGC TGCTGGCTTG TATGTAAGGA GGAGGGGAAA 
951 GAGCTTTTTT CAAGATTTCA TGAGAAATAG ACCAATGAGA CTGAAAGCTG 

1001 CTACTTTATT TGTTTCCTTA GAGAGCTAAA AAGCTAAAAA TCAAAAATGA 

1051 AATGCTTGCA TAGCATTCAT GTTATATAGT T TAG TAT G AC AACTATAACA 

1101 TGTTTATGTT TTCACAGCTT AATGCTACCA AGGTAAAGGA TTGGGAAACA 

1151 GTATCAGCAA TGTGAAAAAT TTACATCAAA TTTCCTAATT GCATTTGGTT 

BNSOOCIO. <WO 9902667A1_I_> 
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1201 GCCTGAAATA TGCATTTATA 

1251 AGAGAAAGGA AGAAATCTGT 

1301 AACACTTAGA ATGACTTCTG 

1351 ATGGTCTTCA CAGAG TATCT 

1401 AAAAGAATAT TCTTAATAAA 

1451 TAAACACAGC AT AG T G AAAA 

1501 AAATAAACAT TTTAATTACA 

1551 ATTGTTGAGA GTACCTTGTC 

1601 ACCTTCTAAA ATATTATTGT 

1651 TAT G TAG T AC CTATGTTGTT 

1701 GTTGCAGTCA CTTCAAACCT 

1751 ACAGAGAAGG CATGACCAGA 

1801 GCTTTATACA GGGCTCTACC 

18 51 AATACTGCTT TTTCTTATTA 

1901 TTCCTTTAAG ACTGTGCTAT 

1951 TATAAACAAT C T T GAG AAAA 

2001 GCTTACAATG ACAGCCTGGC 

2 051 CAGCTTGAAT ACAACATCTA 

2101 TTATTTTTTT CCTTTAAAAG 

2151 ACTATATTTT CTGCTAATTC 

2201 TGTCCCCAGG TAATTCAGGC 

22 51 TTGGTACAGC TGCTCAGTAA 

2 301 AAGTGGATGT TCTTCCACAG 

2351 TTAAAATTTC ATCCACTTTT 

2 4 01 AGTTCTCTGG AAAGGAAGTA 

2 4 51 CCTAAAAGAT TCTGAAAAGC 
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AT AAC AG G T T TTTTTTTTTT CATTAATAAA 
AGAGGTTGAA GCCTATCTGG GCATTTGCTG 
T TATT C AAAA CTATTTCTCA TAGGGTTTTT 
AATTTTGAAA GCTATTAGAG TGGAAAGGAT 
CTTAATGTAT TAGTAAGAGC AATAAGGAAG 
AT CAT GAG CT AATCAGCAGA AAATTCTAAG 
AAGTTCCACT TATACCCTGA CCATGGTACT 
TG CACAT AT C TAGGAGGCAC ATGCTTAATA 
ATTCCTCATA GGAGGGAGAA CTATTACCTA 
TCTGAAAGAT AATATGTTTC ATGTATTTCT 
ATACTCAAGG AAAGGGAGAC AGGCATCTCA 
AAGAGTTTTG TGCCATGTGT CTGCGATCTT 
CACTTTAAAC TGGACTCAAA ACAGTTTCAA 
AG TAAC TAG T TTATAAGGCA ACAAATAAAT 
CAGATAATCC TGGAATAGAT TTGCCTTACT 
CAAAAAGGCA AGAAATTGCT AAGTGCTTCT 
CCTAAAGACA ATGTTTTCTA AGTTTTGAAA 
AGTTTTGGTG CTAATTACCT GCTAGTTTTT 
GCTGTCCCAG CGTCCTAACA TAACAGATGC 
CCGAGGCTCA GTTAGTTGCT CACTGTGTCT 
CTGGGGGAAG GGTTCCTTCC TCCAGACTGA 
GTGTAACTAC TCAGATTCCC AAAGAATTCT 
TGTCTCTTGT TCTCTCTAAT CATCATCATT 
CATTCCTTAA TAGAATTTTC CTTAGTCCAC 
GGCTTCTCAT AhCAGCTGAA AAAACATATA 
TGTAATAACT GTTATACTTG ATATTTTGCT 
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2501 GTTATGAATG AAATGCTACA TATTTTTCCA T T TTAAAAGA CTAAATATGC 

2551 ACAC AT T ATTCCAAT TAAAA AATGTTCATA GATTGATATGGAGGTGTTCG 

2601 TTCATTTTTC ATAAAAATGA TCTTAGTAAC TTTTTTTCTT ATTCATTTAT 

2 651 AG/CTGATCTT CTAACGCAAGTGGAAGGAAA ACCCAAATGT TGCTTCTTTA 

2701 AATTTAGCTC TAAGATACAA TACAATAAAC TAG TAAAGGC CCAACTGTGG 

2751 ATATATCTGA GGCCTGTCAA GACTCCTGCG ACAGTGTTTG TGCAAATCCT 

2801 GAGACTCATC AAACCCATGA AAGACGGTAC AAGGTATACT GGAATCCGAT 

2851 CTCTGAAACT T G AC AT G AAC CCAGGCACTG GTATTTGGCA GAGCATTGAT 

2 901 GTGAAGACAG TGTTGCAGAA CTGGCTCAAA CAACCTGAAT CCAACTTAGG 

2951 CATTGAAATC AAAGCTTTAG ATGAGAATGG CCATGATCTT GCTGTAACCT 

3001 TCCCAGAACC AGGAGAAGAT GGACTG/GTAA G TGAT TAC TGAAAATAACAT 

3051 GCTAAAAACC TTGTTATGTG TTTATTCATA ATGTGAATGA ATAG TAG TGA 

3101 AAAAT AAC T A CCAGTTTCCT GTGCTTATAA G C C AGAC AAA GGCACCTTAC 

3151 CCCAGTGGTA GCCCTGTACT CAATAAAAGT AGGTGTCCCA TTTCACATCC 

3201 TAT GAAAC AC TCTCTTGATA CTTTGACTTT GCATGAGGAT TTAAAAGAAA 

32 51 AAAAGTTATA CCATGGTCCT TAAGTTTTTA GGGAATTCTT TGGAATTGAG 

33 01 AAT GAAAT AT AAAATGCTTT CCGTTGATGT G C TAC AT GAT TATATAAATA 
3351 AAAACATGAA GTCTTCACAG TGGATTCTAG TACTCACCCA AC AAC AC ATT 

34 01 TTTTCCCCCA GAAGAG TGAC CAATTTGTTA AAATTCTTTT GCTTAATAAG 
34 51 GCAGAAAAAT GAACTCTACA AG T TAT AAT T AAAAT AAAAT GCTTTTACTT 
3501 ATAGAAAT T A ACTAGATATA TGTTCAGGTT TATATACTAT T AAAT AT AC T 
3551 ATATTTAAGA TCTCTCATGA TAAATATGTT CCTTGTTTTA TAGACTATTG 
3601 ATGCACTGAT GTATATGTGG ATTACTTTGT GAATTACCCC TGGTAAAATT 
3651 AAAAATT T C A GGCTAGTTAA CTTGTACTAC TTAGCTATTT TCTGAACTGT 
3701 CTTACTGTTC T T T AAC AG G A G T T AAC T TAG GTAATGTCAA CTAATTTAAT 
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3751 ATAAAGTCAA ACAGAAAATA ATGCCTTATA TAT T AT AAAA ATTAATAAAA 
3801 AACCATTTTA AAATCTAGTA TAAGTTTAGA GCTACTCACT CTTCTGGCTT 
3851 ATCTATGCTT GTATTTACTT CTGTTTTCAA AAAATTTTTT AATG TGACCA 
3901 TACCTTTTAT TTCCAGTTAT TGATATAATT TACAACAAAA GAT TAT AC T T 
3951 GCAAGCTTTA TAGTTTTTAA ATGGTCTTAT TTGTAGTGAA TATCATATCT 
4 001 AAATGATATC TAAATGTAAA GTAAATCATA CCTAAATGAA AACATATTCT 
4 051 TTAAGTCATT AT AAAAT T T T CCAGGTGATC AATTTTTCTT TAAATATACT 

4101 ACATAAAATG TTATTGACTC CCAAAATGAT GTTATTTTGT ATAATCTTAA 

4151 ATACCAATAA TTACCAGGTC TATTTTGGTT TTAGTGTAGG ATAAAAAAGA 

42 01 ATGTGTTCTT TTTTCTAGGT AGCATTTTAA TGATCAAAGT TGGTGACGTG 

4251 ACAGAGGTCT TAAG TAT TAT TAAACAGATG ATTAATAAGA TGTATTCCTC 

4 301 AGACTTTTCC ATATAAAAGG AAAAATGTCT CAAATTCATG AAAAGATTGG 

4 3 51 TACAGGAGGA GG AT TAG C AA ATTGTAGTTT AAATATCTGA ATGGAAACAC 

4 4 01 TTTTTAGTGA AAGAATAAAG GGAATATCAT TGTATCTTCT TCTGAGTCTG 

4 451 TGCCTCTCTC TCTTGGAGTT AGTCTTTCCA ACCCTATATA CTTACCACTA 

4 501 TCTTCATCCC TCTACCTTCC TTTTTCCCA7 TACATCTGTG CAGTACTGGG 

4 551 TGGCAA.CTAT TGTGTTTCGG TGTTAATATC CAAGTTTCCC TGAATAAGAC 

4 601 CAAGTGAATG GAGGATGAAT GAGTATACCT ATCCCTCCAG GGGTCATCAG 

4 651 ACATATTTAG CCACCATATT TAATCAATAA GCAGGAAGAC AT AAG C TAG C 

4701 CTTGTCCTTC TTCTTTCCTC CCTGCTCCTT TCTCTTCTCT TCCCCCTCTC 

47 51 CCTTTACTGT CATCCATCAG TATTTTCAGA GCATCTATTA TGTGTCAGGC 

4 801 ATTCAGATAC TCAAACGGAG GAAAACAAGA ATAAACAAGA CAAAGATCTG 

4 8 51 ACCACAGGGG AATCCCTATG GCTACTGTAG ACTTTTGAGC CATAAAGGAA 

4 901 GAATCAAGCC TAG T G TAAAT GAAAATTCCT TAATGCTGTG CCTT^TAAAA 

4 951 AGAAATGTGA CATAAGCAAA ATGATTAGTTTCTTT CTTTA AT AAT^AriTr 

5001 CTTGAGGTAG GAGAGTGTTT TGGGATCTAT TATTAAC TCT TCTTTCCTTT 
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